FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGC TTTACCT TATATCAGTGACAC TGG 
TACAGTAGC TCCAGAAAAAT GCT TAT T T GGGGCAATGCTAAATAT T GCGGCAGT T T TATGCAT TG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
AT CAGT TT TGCACAGTGGCAATT T T GGGAC TGAT T T AGAACAGAAACT CCAT T GGAACCCCGAGG 
ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATAT T TGA TGAAAGGATAAAATAT TTCTGTAATGATTATGAT TCTCAGGGATTGGGGAAAGG 
T TCACAGAAG T T GC TTAT TC T TC T CT GAAAT T T T CAACCAC T T AAT CAAGGC TGACAG TAACACT 
GAT G AAT GCT G AT AAT C AGG AAAC AT GAAAGAAG C C AT T T GAT AG AT TAT T C T AAAG GAT AT CAT 
CAAGAAGACTAT TAAAAACACCTATGCCTATACTT TT TTATCTCAGAAAATAAAGTCAAAAGACT 
ATG 



10063568 



Page 151 



FIGURE 2 

<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLS FLPSALVIWTSAAFI FSYITAVTLHHIDPALPYI SDTGTVAPEKCLFGAMLNIAAV 
LCIATIYVRYKQVHALSPEENVIIKLNKAGLVLGILSCLGLSIV^FQKTTLFiy^VSGAVLTFG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE 3 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTT ACACT TCGCCATGAGT TT CCT CATC GAC TCCAGCAT CATGAT TACCTCCCAGATACTATTT T TTG 
GAT TTGGGTGGCT TTTCTTCATGCGCCAATT GTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 
CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
Cl CAT TCTCAGCCCAAAACAT GGGAT CTT ATCCATAGAACAGCTCAT CAGCCGGGTTGGTGT GATT GGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
fi CTCAGGAAT GT GACTGACACGGAT ATT CTAGCCCT GGAACGGCGACT GCTGCAAACCAT GGATAT GAT 

CATAAGCAAAAAGAAAAGGAT GGCAAT GGCACGGAGAACAATGT T CCAGAAGGGGGAAGTGCATAACA 
m AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
% CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
Q ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 
Ml ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGAT CCTGTCACAAGAGGCATTGAGAT CACT GT GAAT TAT CTGGGAATCCAAT TTGATGT 

f I 2 

GAAGT T T T G GT C CCAAC AC AT TT C C T T CATT C T T GT T GGAATAAT CAT CGT C ACAT C CAT CAGAGGAT 
Jf« TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 
AGAATACC GCACCATAATCACTGAAGTC CTT GGAGAACT GCAGT T CAACTTCTAT CACCGTTGGT TTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCAT CCCGTGTGGATATGAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTATGGTAGCT GAGCCAAACACGTAGGAT TTCCGTTT TAAGGT T CACATGGAAAA.GGTTATAGCTTT G 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAA?^AAAAAAAAAAAAAAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 

MS FL I DS S IMI T S Q I L FFGFGWLFFMRQLFKDYE I RQYWQVT FSVT FAFSCTMFE L 1 1 FE I LGV 
LNSSSRYFHWKMNLCVILLILVFWPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKG^ 

LFLETADLYATKERIEYSKT FKGKYFNFLGYFFS I YCVWKI FMAT INI VFDRVGKTDPVTRGIE I 
TVNYLG I QFDVKFWSQH I S FI LVG 1 1 1 VTS IRGLL I TLTKFFYAI S SSKS SNVI VLLLAQ IMGMY 
FVSS VLL IRMSMPLEYRT 1 1 TEVLGELQFNFYHRWFDVI FLVSALS S I LFLYLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAGTTCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACG ATG GCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAAAAGGCGAACCAGCAGCTGAATTTCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T T GCAGCC TAT T GT TACAAC TCATCT GATAC TT GGACTAAC TCGT GCATT CCAGAA 
M: ATTATCACCACCAAAGATCCCATATTCAACACTCAAAC TGCAACACAAACAACAGAATT TATTGT 

i*| CAGT GACAGTACC TAC TCGGT GGCATCCCCT T ACT C TACAATACCT GCCC CTAC T AC TACTCC T C 

J? CT GCTCCAGC TTCCACT T CTAT TCCACGGAGAAAAAAAT TGAT T TG TGTCACAGAAG TT T T TAT G 

W GAAACTAGCACCATG TC TACAGAAAC TGAACCATT TGT T GAAAATAAAGCAGCAT TCAAGAAT GA 

-IP AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
||j CAGC TGGTC T T GGAT T T TGCTAT GTCAAAAGGTAT GTGAAGGCCT T CCCT TT TACAAACAAGAAT 

1 fl CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
TGAGGAATCAAAGAAAA.CTGATAAAAACCCAGAAGAGT CCAAGAGT CCAAGCAAAAC TACCGT GC 
f! GAT GCC TGGAAGCTGAAGT TTAGATGAGACAGAAAT GAGGAGACACACC T GAGGCT GGT T TCT T T 

CO CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
3 CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
p! GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCA7VAGTGCAAGGAC 
|y| CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
O AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
f|| GC T CTGAAAGAGAAACACGTAT CC CACC T GACATGT CC TTCT GAGCCCGGTAAGAGCAAAAGAAT 

f*f GGCAGAAAAG T TT AGCCCC TGAAAGCCATGGAGAT T CT CATAACT T GAGACC TAAT C TC TGTAAA 

GC TAAAATAAAGAAATAGAACAAGGCTGAGGAT AC GACAGTACAC T GTCAGCAGGGACTG TAAAC 
lu ACAGACAGGGT CAAAGTGT T T TCT CT GAACACAT T GAGT TGGAATCAC TGTT TAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC TC T T ATAAAT T T C TAT T T T TAT CT GAGT TACAGAAAT GAT TAC T AAGGAAGAT T 
ACT CAG TAAT T TGT T TAAAAAGTAATAAAAT T CAACAAACAT TT GC TGAATAGC TAC TATATGT C 
AAG TGC T GT GCAAGGTAT TACAC T CT GTAAT T GAATAT TATT CCTCAAAAAATT GCACATAGTAG 
AACGCTATC TGGGAAGC TAT T T T T T T CAGTT T T GAT AT TTCTAGC T T ATC TACT TCCAAAC TAAT 
T TT TAT T T T T G CT GAGACTAAT C T TAT TCAT T T TC T CTAATATGGCAACCAT TATAACC T TAAT T 
TAT TATTAACATACCTAAGAAGTACATTGTTACCT CTATATACCAAAGCACATT TTAAAAGTGCC 
ATTAACAAATGTATCACTAGCCCTCCTTTTTCCAAC7\AGAAGGGACTGAGAGATGCAGAAATATT 
TGT G AC AAAAAAT T AAAG CAT T T AGAAAAC T T 
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FIGURE 6 



MARC FSLVLLLTS I WTTRLLVQGS LRAEELS I QVS CRIMGI TLVSKKANQQLNFTE AKEACRLLG 
LSLAGKDQVE TALKAS FE TC S YGWGDG FWI SRI S PNPKCGKNGVGVL I WKVPVS RQFAAYC YN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKL I C VTE VFME T S TMS TE TE P FVENKAAFKNE AAG FGGVP T ALL VLAL L FFGAAAGLG FC YVK 
RYVKAFPFTNKNQQKEMIETKVVKEEKANDSNPNEESKKTDKNPEESKSPSKTTVRCLEAEV 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N- glycosylate on site. 

amino acids 53-57, 130-134, 289-293 

Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GC T CTCAGC TACC CGCAGGAGGAGGCCACCC TCAATGAGAT G T TCCGCGAGGT T GAGGAAC T GAT 
GGAGGACAC GCAGCACAAAT TGCGCAGC GCGGT GGAAGAGATGGAGGCAGAAGAAGCT GC TGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCT CCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGTT GGAAATAAT ACCAT CCAT G TGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
jU TGGACAAATGGTCT TT TCAGAGACAGTTATCACATCTGTGGGAGACGAAGAAGGCAGAAGGAGCC 

fj ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
PI ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
% GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
f* s ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
W ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
¥l CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CH CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
III GAGATCCTGCTGCCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGTGCG 
, CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
1^ CCGCCGCTGCACTGCTGGGAGGGGAAGAGATT TAG ATCTGGACCAGGCTGTGGGTAGATGTGCAA 
TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
J; 5 5 C^TCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
^ CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
III GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
CI T T T GT T C TACAT GGCT T TGATAAT TGT T T GAGGGGAGGAGAT GGAAACAATG T GGAG T CT CCCT C 

flj TGATTGG TT T T GGGGAAATGTGGAGAAGAGT GCGC TGC T TTGCAAACAT CAACCT GGCAAAAAT G 

CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGT7VAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCAGACCAGCCTTGGTGGCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T T T TC T T GAGGCAT GCACATC TGGAAT TAAGGTCAAAC TAAT T CT CACAT CCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAACCTGCAGAAACAGT AC T TAGGTAAT TGTAGGGC GAGGAT TATAAATGAAAT TTGC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTG7\ACTCTACGCCACTCCACA7yiTGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGAT TGTAT AAGCATGC T T T CT T T GAGT T T TAAAT TATG TATAAACATAAGT TGCATT TAGAA 
AT CAAGCATAAAT CAC T T CAAC T GCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RS AVEEMEAEEAAAKAS SEVNLANLPP S YHNETNTDTKVGNNT I HVHRE I HKI TNNQT GQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCAS GLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGEPAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TATAT T GACCACAAGC TCAGAGGAG T GCAGAACCC TGTAGCCCGC TGCAT CATGTGC TGT TT 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 
ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 
ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 
GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT TC TGGGCAAGAAGAAC GAGGCGC CCCCGGACAACAAGAAGAGGAAGAAGTGACAGC TCCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GC TAC T CGGGAGGC TGAGGCAGGAGAAT CGC TT GAACC CGGGAGGCAGAGGT TGCAGTGAGCCGA 
GAT CGC GCCACT GCAC T CCAACCT GGG T GACAGACTCTGTCT C CAAAACAAAACAAACAAACAAA 
AAGATT TTATTAAAGATATTTTGTTAACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFAS FYWAFHKPQDI PT FPL I SAFI RTLRYHTGSLAFGAL I LTLVQ IARVI L 
EYIDHKLRGVQNPVARC IMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAFMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features: 
Transmembrane domains: 

amino acids 57-80 (type II), 110-126, 215-231, 254-274 

N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins. 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
M GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
O ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
>? CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 

01 

r, GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 

III 

|fl' TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
pi CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
CO CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
* T T T GT CAC C T GGT CAGCCCTAT CC AGT AT CC C T GAAC AGAAAT G C AAC C C C CAT T T G CCAACC C AGC T 

GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
j£ TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCC TGATGCAGACCGAGGAGTGCCCACCTAT GCTAGACGC CACACAGCAGCAGCAGCAGCAGGT 
Q GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
HI ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 

MGACLGAC S LLSCASCLCGS APC I LCSCCPASRNS TVS RL I FT FFL FLGVLVS I IMLS PGVES QL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NG FW FFKFL I LVGL TVGAFY I PDG S FTN I WF Y FG WGS FL F I L I QLVLL I D FAH S WNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 

$Z SFFHFCLVLASLHVl^TLTNWYKPGETRKMISTW^^ 

?J FS 

%} » 

Ml Signal sequence: 

^1 amino acids 1-20 

« Transmembrane domains: 

0 amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 

1 Iji 

f* 
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FIGURE 13 



CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC TCC TGTAACCCTCCTCCAGG ATGA ACCACCT GCCAGAAGACAT GGAGAAC G 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATC TGATGT CAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCAT TGAGAACACAT T AGAGAAGGAGGTGATGCAGTATGAC TAC T AT TCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 

Pi 

1% ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
?f| TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
fij C T T GCCTGGAT TGAGACGTGGTT CCTGGAT T T CAAAGT GTTACCTCAAGAAGCAGAAGAAGAAAA 

Wl CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
zl GTCAGTTTTATTCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 

fs'i 

GAGAAACCAC T T T TAGAACTATGAGTACTAC T T T T GT TAAAT GT GAAAAACCCT CACAGAAAGT C 
2 i, 1 1 .i n . 

ATCGAGGCAAAAAGAGGCAGGCAGT GGAGT C T CCC TGTC GACAGTAAAG T TGAAAT GGTGACGT C 
|f| CACTGC TGGC T T TAT TGAACAGC TAATAAAGAT T TAT T TAT T GT AATAC CTCACAAACGT TGTAC 

0 CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 

m 

TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
i% 7\AAAGAC TTAATATATTGAAGTAACAC TTTT TTAGTAAGCAAGATACCT TTTTATT TCAAT TCAC 

AGAATGGAAT T TT T T TGT T TCAT GT CT CAGAT T TAT T T TGTAT T T CTT T T T TAACAC TCTACAT T 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTT GCAAAAGGGGAAGAAAGGAAT T GC GAATACAT GTAAAAT GT CACCAGACAT T T GT AT TAT T T 
T TATCATGAAATCATG T TT T T CT CT GAT TGT T C T GAAATGT T CTAAATAC TC T TAT T T TGAAT GC 
ACAAAATGAC T TAAACCAT TCATAT CATGTT T CCT T TGCGT T CAGCCAAT T T CAAT TAAAATGAA 
C TAAAT TAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRT FCLFVTFDLLF 
VTLLWIIELNVNGGIENTLEKEVMQYDYYSSYFDIFLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 



ACT CGAAC GCAG T T GCT T CGGGAC C CAGGAC CCCCT CGGGCCCGAC CCGC CAGGAAAGAC T GAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
M AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
Hi CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
*** GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
];? TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
Id TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
\jl TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
JJI TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
tt-l CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
s TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
O CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
|}| ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
JLJ GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
W GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
fll AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
Q GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
Jgj GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
iW GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCGTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
TT TAGGAACATGTT TTGC TTT TTTAAAATATATATATTTATAAGAGATCC TT TCCCAT TTATTCT 
GGGAAGATGT TT TTCAAACTCAGAGACAAGGACTT TGGT TT T T GTAAGACAAACGATGATATGAA 
GGCCTTTTG T AAG AAAAAAT AAAAG AT G AAG T G T G AAA 
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FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAALQELDVSNLSLQALPGDLSGLFPRLRLLAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFP PKNAGRLLLELDYADFGCPAT TT TATVPT TRPWRE PTALS S S LAPT WLS PTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LIAAIAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLE PGPKATE GGGEALPSGSE 
CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 

GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATCCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAAT CCT TAGAT TCCAAGAC TACT T T GACATCAGATGAGTCAG TA 
AAGGACCATAC TAC T GCAGGCAGAG TAGT TGC TGG T CAAATAT T T CT TGAT TCAGAAGAATC TGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATAT CAGCT T T C TAGAGTC TCCAAATCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
T T TCCTAGATAAGGAGTATGAT GAATGTACAT CAGATGGGAGGGAAGATGGCAGACT GTGGTGT G 
CTACAACCTATGACTACAAAGCAGATGAAAAGTGGGGCTTTTGTGAAACTGAAGAAGAGGCTGCT 
AAGAGACGGCAGATGCAGGAAGCAGAAAT GATGTATCAAAC T GGAAT GAAAAT CCT TAATGGAAG 
CAATAAGAAAAGC CAAAAAAGAGAAGCATAT CGGTATC TCCAAAAGGCAGCAAGCATGAACCATA 
CCAAAGCCC T GGAGAGAG TGTCATATGCTCT T T TATT TGGTGAT TAC TT GCCACAGAATATCCAG 
GCAGCGAGAGAGAT G TT T GAGAAGC TGAC TGAGGAAGGC TCT CCCAAGGGACAGAC TGCTCTTGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCT7VATAGCCCACATGGTTTTGGTAAGTAGACTT TAG TGGAAGGCT 
AATAATAT TAACAT CAGAAGAAT T TGTGGT T TATAGCGGCCACAAC T TT T TCAGCT T TCATGATC 
CAGATT TGCT TGTATTAAGACCAAATATTCAGTTGAAC TTCCTTCAAATTCT TGTTAATGGATAT 
AACACAT GGAATCTACAT GTAAAT GAAAGT T GGT GGAGTCCACAAT T TT TCT T TAAAATGATTAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGT CAGAAT CAT T T T T TACAT TAGAT TATCATAAT T TTAAAAAT T TT TC T T TAGTT T TT CA 
AAATT TTGTAAATGGTGGCTATAGAAAAACAACAT GAAATAT TATACAATATTTTGCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACT CAT T TT TAATAAAAT TAT GTC T AAGAT TAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSLASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAKRRQMQEAEMMYQTGMKILNGSNKKSQKR 
EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL I AHMVLVSRL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

i 

g Tyrosine kinase phosphorylation site. 

Qj amino acids 220-228 

N-myristoylation sites. 

;;f amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

f|| 

Glycosaminoglycan attachment site. 

amino acids 2 67-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATT CAGAT TT TAAGCCCAT TCTGCAGTGGAATT TCATGAACTAGCAAGAGGACACCATCT TC T T 
GTAT TATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGC TCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACT71AAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATT TCATGTAATCGCTGCCTGTCTGACT GAATCAGGAT CAACAGCT TTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCC GGCG TGC TGGC TCCCAC T GACT GGC T GACACT AGAGGAC TACAGAGAAC CTAT T GAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 
AT GCAT TGAACCAGGAT T GT TCAAAACAAAC TT GGCAGATCCAGTAAAGG TAAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
AGTCTAGACAAAC TGAAAGGCAATAAATCCTAT GT GAACAT GGACCT CT C TCCGGTGGTAGAG T G 
CATGGACCACGCT C TAACAAGT CT C T TCCCTAAGACTCAT TATGCCGCT GGAAAAGATGCCAAAA 
TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGC TGGCTAAT CCCAAGGCAGTG TGAC T CAGC TAACCACAAAT GT C TCC TCCAGGC TAT GA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CATC TCT TAT CT AAATAT TAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAftRTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPENVKRTAQ1WKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
REPIEVNLFGLISVTLmLPLWKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAEIANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GG T GAAGCCAAGAACAGCAT TAC T GAT TCCCAAAT GGAT GATGTT GAAGT TGT T T ATACAAT TGA 
CAT TCAGAAATATATTCCATGCTATCAGCTT TTTAGCTT TTATAATTCT TCAGGCGAAGTAAAT G 
AGCAAGCAC TGAAGAAAATAT TAT CAAAT GTCAAAAAGAAT GT GGTAGGT TGGTACAAAT T CCG T 
CGT CAT T CAGATCAGATCATGACG T T TAGAGAGAGGC TGCT TCACAAAAACT TGCAGGAGCAT T T 
T T CAAACCAAGACC TTG T TT TTCT GCTAT TAACACCAAGTATAAT AACAGAAAGCT GC T C TACTC 
AT C GACTGGAACAT TCC T TATATAAACC T CAAAAAGGAC T T T T TCACAGGGTACCT TTAGT GGT T 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
TT T TAGCCGAGCAGTACAAACACACAGC TCTAAAT T T TT TGAAGAAGATGGAT CCT TAAAGGAGG 
TACATAAGATAAAT GAAATGTAT GCT TCATTACAAGAGGAAT TAAAGAG TATAT GCAAAAAAGT G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
AT GT CT T TAAAAAATAGACAT GT T T CTAAAAGTAGCTGTAAC TACAACCACCATCTCGATGTAGT 
AGACAATCTGACCT TAATGGTAGAACACACTGACAT TCC TGAAGCTAGTCCAGCTAGTACACCAC 
AAAT CATTAAGCATAAAGCCTTAGACT TAGATGACAGATGGCAATTCAAGAGATCTCGGTTGT TA 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGAT GAAGGGT T T T GGTGAATAT TCACGGTC T C 
CT ACAT T T TGA T CC TT T TAAC CT TACAAGGAGAT T T T T T TAT T T GGCT GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T G T T T GCAGTAAT ACACAGATAACT CT TAGT GCAT T TACT TCACAAAG TACT T T T T CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCT TTAGAAT TGGAAAAGTGAGACCAGGCACAGTGGCTCACACC TGTAAT CCCAGCACT 
TAGGGAAGACAAGT CAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTAT T 
GAGACCATGTCT AT TAAAAAATAAAATGGAAAAGCAAGAATAGCC T TAT TTTCAAAATATGGAAA 
GAAAT T TAT ATGAAAAT T TATCT GAGT CAT TAAAATT CT CC T T AAGT GATAC T T T T TT AGAAGTA 
CAT TATGGCTAGAGTTGCCAGATAAAATGC TGGATATCATGCAATAAAT TTGCAAAACATCATCT 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



O 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFS FYNS SGE WEQALKKI LS3SIVKKNVVGWYKFRRHS DQ IMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QI QAAREKNI QKDPQEN I FLCQALRT FFPNS E FLHSCVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEE I EKMKGFGE YSRS PT F 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites . 

amino acids 75-79, 322-326 



N-myristoylation site. 

HI amino acids 184-154 



Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCC GCGC GGCG GAGGGCAGAGT CAGCC GAGC CGAGTCCAGCC GGAC GAGCGGACCAGC GCAGGGCAG C CCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACC ATG GCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGAT GC CC CCCT CCAC GAGAT CAACGGTGAT CATT T GAAGAT CT GT CCCCAGGGTT CTACCT GCT GCTCT 
CAAGAGATGGAGGAGAAGTACAGCCTGC^^GTAAAGATGATTTCAAAAGTGTGGTCAGCGAACAGTGC^\ATCATTTG 
CAAGCTGTCTTT GCTTGACGT TACAAGAAGTT T GAT GAAT T CT T CAAAGAACTACTT GAAAAT G CAGAGAAATC C CT G 
AAT GAT ATGT TT GT GAAGACATATGGCCATTTATACATGCAAAATTCT GAGCTAT TTAAAGAT CTCTT CGTAGAGTTG 
AAAC GTTACTAC GT GGT GGGAAATGTGAACCT GGAAGAAATGCTAAATGACT T CT GGGCT CGCCTCCT GGAGC GGAT G 
TT CC GCCTGGT GAACT C CCAGTACCACTT TACAGATGAGTAT CT GGAAT GT GTGAGCAAGTATAC GGAGCAGCT GAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TT AGCGGTT GCGGGAGATGT C GT GAGCAAGGT CT CCGTGGTAAAC CCCACAGCC CAGT GTAC C C ATGCCCTGTT GAAG 
ATGATCTACTGCTCCCACTGCCGGGGTCTCGTGACTGTGAAGCCATGTTACAACTACTGCTCAAACATCATGAGAGGC 
TGTTT GGCCAACCAAGGGGAT CTCGAT TTT GAAT GGAACAATT T CATAGAT G CTAT GCT GAT GGT GGCAGAGAGG CTA 
GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 
AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
T C CAT C T CTGAAAGT GCCTT CAGT GCT CGCT T CAGAC CACAT CACC CCGAGGAACGCCCAACCACAG CAGCT GGCAC T 
AGTTT GGAC CGACT GGT TACT GAT GT CAAG GAGAAACTGAAACAGGC CAAGAAAT T CTGGT CCT CCCTTCCGAGCAAC 
GTTT GCAAC GAT GAGAGGATGGCT GCAGGAAACGGCAAT GAG GAT GACT GT T GGAAT GGGAAAGGCAAAAGCAGGTAC 
CT GTT TGCAGT GACAGGAAAT GGAT TAGCCAACCAGGGCAACAAC CCAGAGGT CCAGGT T GACAC CAGCAAAC CAGAC 
ATACT GATC CTT CGTCAAAT CAT G GCT CT T CGAGT GAT GACCAGCAAGAT GAAGAAT G CATACAAT GGGAACGACGT G 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAATGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTAC CT CCT CACTGT CTT CT GCAT CTT GT T CC TGGT TAT GCAGAGAGAGT GGAGATAAT TC T CAAACTCT GAG 
AAAAAGTGTTCATCAAAAAGTT7VAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTTTGCTTTTTAAATGAA 
T GGACAACAAT GTACAGTTTTT ACTAT GT GGCCACT GGT TTAAGAAGTGCT GACT TT GTT TT CT CAT T CAGT TTT GGG 
AGGAAAAGGGACTGT GCATTGAGT T G GTT C CT GC TCC CCCAAACCAT GTTAAAC GT GGCTAACAGT GTAG GTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
T CT GAAATAT TAAATAGCT GTACAGAAGCAGGTT TTATTTAT CAT GTTAT CTTAT TAAAAGAAAAAG CCCAAAAAGC 
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FIGURE 24 



MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
L YMQNSEL FKDL FVELKR YYWGNVNLEEMLND FWARLLERMFRLYNS Q YHFT DE YLECVS KYTE 
QLKPFGDVPRKLKLQVTRAFYAARTFAQGIAVAGDWSKVSVWPTAQCTHALLKMIYCSHCRGL 

vtvkpcynycsnimrgcianqgdldfew™fidamlmvaerlegpfniesvmd 
mqdnsvqvsqkvfqgcgppkplpagrisrsisesafsarfrphhpeerpttaagtsldrlvtdvk 
eklkqakkfwsslpsnvcndermaagngneddcwngkgksrylfavtgnglanqgnnpevqvdts 
kpdililrqimalrvmtskmknayngndvdffdisdessgegsgsgceyqqcpsefdynatdhag 

KS ANEKADS AGVRPGAQAYLLT VFC I L FL VMQREWR 



III Important features: 

ffl Signal peptide: 

ttl amino acids 1-22 



ATP/GTP-binding site motif A (P-loop) 

amino acids 515-524 



"■J- N-glycosylation site. 



amino acids 514-518 

GlycosaminoglYcan attachment sites. 

amino acids 494-498, 498-502 



N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CT GAGT CAT CCCCAGGGATCAGGAGCC T CCAGCAGGGAACC T TCCATTATAT TCT T CAAGCAAC T 
TACAGCTGCACCGACAGTTGC GATGA AAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AG C C C C G AG AAG AAAA T T CAT G AC AG TGTCTGGGCTGC CAAAG AAG C AG TGCCCCTGT GAT CAT T 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTT GTAG GAGCTCTG 
AGCGCCCACTCTT CCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACAC TC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPIJyiLMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features : 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 2 7-33 , 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
H AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
^ CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 
yj TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 
If! GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
*M GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
m GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
TT CCCCAGCC TCCAAT TAGAACAAGCCACCCACCAGCC TATC TATC TTCCAC T GAGAGGGACC TA 
Ifl GCAGAAT GAGAGAAGACAT T CAT GTAC CAGC TACT AGTCCCTCT CTCCCCAACCT CTGCCAGGGC 

13 AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
.JjJ CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
AT GC C AGAGC AAG AC T CAAAGAGG C AG AGG TTTTGTTCT CAAAT AT T T T T T AAT AAAT AG AC G AA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHimLLGCWQPLCKSYFPYimVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFWAPGEDM 
RQI^GSMDVWCTLVLCSVQSPRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 




Leucine zipper pattern . 



amino acids 10-32 



N-myristoylation sites. 



amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTYACCT ATG CTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGC TGGGAGAT AGGGAACAGAAGAGGGT AGT GGGT GGGC T AGGGGGGC T GCC T TAT T TAAA 
GT GGT T G TT TATGAT TCT TATACTAAT T TATACAAAGATAT TAAGGCCC TGT TCAT TAAGAAAT T 
f ^ GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 

rt 

S AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 

If! 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

amino acids 3-18 
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FIGURE 31 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
T GT CT GC GGTGTT TT AT GGTGGCT GTAT TAT GAC TATACCAACGACCTCAGCAT AGAAT T GGACA 
CAGAAA.GGGAAAATATGAAG T GCGT GC TGGGGT T T GCTATCGTAT CCACAGGCAT CACGGCAGT G 
JjJ CTGCTCG TC T TGATT T T TGT TCT CAGAAAGAGAATAAAAT TGACAGT T GAGCTTTT CCAAAT CAC 

in AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
||| TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
111 ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
m AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

is 

f% TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
|fl GATT CC GAGAAT CAT T GTCATGTACAT GCAAAACGCAC T GAAAGAACAGCAGCAT GGTGCAT T GT 

Q CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
;!f AACCAGAATGCATATAC TACAAC T GCTAT TAATGGGACAGAT T TCT GTACATCAGCAAAAGAT GC 

SiT: AT TCAAAATC TT GT CCAAGAACT CAAG TCAC T TTACAT C TAT TAAC TGCT TTGGAGACT T CATAA 

TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAATGATGGATCGTCAGAAAAGCCCTACTTTATGGATCAAGAATTTCTGAGTTTCGTA 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCAC T CAT TAAGGAAT GAGGA 
GGGAACAGAACT C CAGGCCATTGT GAG ATAGA TACCCAT T TAGGTATC T GTAC CT GGAAAACATT 
T CC TTC TAAGAGC CAT T TACAGAATAGAAGAT GAGACCAC TAGAGAAAAGT TAGT GAAT t TT T TT 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 



APP ID=1 0063568 



Page 181 of 318 



FIGURE 32 

MS GRDT ILGLCI LAXALS LAMMFT FRFI TTLLVH I FI SLVI LGLLFVCGVLWWL YYDYTNDLS IE 
LDTERENMKC VLG FAI VS TG I TAVLLVL I FVLRKRI KL T VE L FQ I TNKAI S S AP FLL FQPLWT FA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTC YFNRSKNDPPDHP ILSSLS I LFFYHQGTWKGS FLI SWRI PRI I VMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide : 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT T TGTAT GGGAT T T GT GAT GCAGGAAAGCCTAAGGGAAAAAGAATAT T CAT T CTG TGTGG T 
GAAAAT T T T T T GAAAAAAAAAT T GC CT TC T TCAAACAAGGGT GTCATT C T GAT AT T TATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAGT T CAC TGT GCC TCAGATCAACTGC 
GAT GT CAAAGCCGGAAAGAT CAT CGATCC TGAGT TCAT T GTGAAAT G T CCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGT GT GCT TGATAAT T CAGGAGGGAAAATACT TG T TCGGAAGG T TGCTGGACAGT CT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCACCTATTCCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
GC CACAGGAGCCAGGAGATGGAT CT CT GGTC CAC TGCCACC TACACAAGCAGC CAAAACAGGCCC 
AGAGCT GATCCAGG TAT CCAAAGGCAAGATC CTT CAGGAGC T GCC T T CCAGAAACCT GT TGGAGC 
GGATGTCAGCCTGGGACTTGTTCCAAAAGAAGAATTGAGCACACAGTCTTTGGAGCCAGTATCCC 
T GGGAGATCCAAAC TGCAAAATT GACT TGTCGTTTT TAAT TGAT GGGAGCAC CAGCAT T GGCAAA 
CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGTCCAC TGAT GGGTGT TGTCCAG TAT GGAGACAACCC TGC TAC T CAC T TTAACCT CAAGACAC 
ACACGAAT TC T CGAGAT CTGAAGACAGCCATAGAGAAAAT TAC TCAGAGAGGAGGACT T TC TAAT 
GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACT TGCGAGAGAG TCAGGAATCAACAT T TT CTT CATCACCAT TGAAGGTGC T GCT GAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GT CCAAGCCCAACAAGAGGAAGT TAATGATCCTCATCACCGACGGGAGGTCC TACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GC CCAAGAGGAGC T AGAAGT CAT T GCCAC TCACCCC GCCAGAGACCAC TCCTTCTT T GTGGACGA 
GT T TGACAAC C TCCATCAGTATGT CCCCAGGATCAT CCAGAACAT T TGTACAGAGT T CAAC TCAC 
AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 
TGAT CACAAACGTATAGAAT GAGCCAAAAGGC TACATCATGT T GAGGGT GC TGGAGAT T T TACAT 
TT T GACAAT TGT T T TCAAAATAAAT GT TCGGAAT ACAGT GCAGCCC T TACGACAGGCT TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGT7UVCCCTCAGCAAGTTTCAT 
T T T T G T CAT GACAAT G TAGGAAT T G C T GAAT T AAAT GT T T AGAAG GAT GAAAAAT AAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE ad 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
T PT TLPRPS PSAAS T TS I PRPQS VGHRS QEMDLWS TAT YTS SQNRPRADPGI QRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVVVVMVDGWPTDKVEEASRLARESGINIFFITIEGAAENEKQYWEPNFANKAVCRTNG 
FYS LHVQS WFGLHKTLQPL VKRVCDTDRLACSKTCLNSADI GFVI DGS S S VGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI L I TDGRS YDDVRI PAMAAHLKGVI T YAI GVAWAAQEELE VI ATHPARDHS FF 
VDE FDNLHQ YVPR 1 1 QN I CTE FNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites ♦ 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CC GAGC AC AGGAGATT GCCT GCGTTT AGGAGGTGGCTGCGTTGT GGGAAAAGCT AT C AAGGAAG AAAT TGC 
CAAACC AT GT CT T T TT TTGT GT TTTC AGAGTAGT TC AC AACAGATCTG AGTGTTT T AATT AAGC AT GG AAT 
AC AGAAAACAAC AAAAAACT TAAGCT TT AATT TCATCT GGAAT TCCAC AGTT TTCT T AGCTC CC TG GACC C 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGG ATGG CCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGT CAC TGAGATC CCT C AAAT GGAGC CT C CTGCT GCT GTCACT CCT GAG TTT C TT TGTGAT GT GGT AC 
CT CAGCCTT CCCCACT AC AATG TGAT AGAACGCGTGAACT GGATGT ACT T CT ATGAGT ATGAGCCG ATTT A 
$ sh CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
J'lf TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
111 TGGTGGGGAT AT GAGGTT CT T ACATT TT TCT T ATTAGGCC AAGAGGCTGAAAAGGAAG AC AAAAT GTT GGC 

a s g AT TGTCCT TAGAGG AT GAAC AC CTTCTT TAT GGTGACAT AATCC GAC AAG AT TTTT TAGACACAT ATAAT A 

|p AC C TGACCTTGAAAAC CAT TAT GGCATT CAGGTG GGT AACT GAG TT TTGCCC CAAT GCC AAGT ACGT AAT G 

ff| AAGACAGAC ACT GATGTT T T C ATCAATACTGGC AATTT AGT GAAGT ATCTTT TAAACC T AAACC ACT CAG A 

|f| GAAGTT TT TC ACAGGT TAT CCT CT AATT GAT AAT TATT CC T AT AGAGGATTT TACC AAAAAACC CAT ATTT 

2 CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
O TT GGT GCC AAGG AT CT AT GAAATG AT GGGTCACGTAAAACCC AT CAAGT T TG AAGATGTTT ATGTCGG GAT 

til CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
^ AT T TGG AT GTCT GT CAACTGAG ACGT GT GATT GC AGCC CATGGC TT TT CT TCC AAGGAGAT C AT C ACT TT T 

ill 

]Z TGGCAGGTCATGCTAAGGAACACCACATGCCATTATTAACTTCACATTCTACAAAAAGCCTAGAAGGACAG 

I;;; gataccttgtggaaagtgttaaataaagtaggtactgtggaaaattcatggggaggtcagtgtgctggctt 

III 

acactgaactgaaactcatgaaaaacccagactggagactggagggttacacttgtgatttattagtcagg 
cccttcaaagatgatatgtggaggaattaaatataaaggaattggaggtttttgctaaagaaattaatagg 
accaaacaatttggacatgtcattctgtagactagaatttcttaaaagggtgttactgagttataagctca 
ctaggctgtaaaaacaaaacaatgtagagttttatttattgaacaatgtagtcacttgaaggttttgtgta 
tatcttatgtggattaccaatttaaaaatatatgtagttctgtgtcaaaaaacttcttcactgaagttata 
ctgaacaaaattttacctgtttttggtcatttataaagtacttcaagatgttgcagtatttcacagttatt 
attatttaaaattacttcaactttgtgtttttaaatgttttgacgatttcaatacaagataaaaaggatag 
tgaatcattctttacatgcaaacattttccagttacttaactgatcagtttattattgatacatcactcca 
ttaatgtaaagtcataggtcattattgcatatcagtaatctcttggactttgttaaatattttactgtggt 
aatatagagaagaattaaagcaagaaaatctgaaaa 
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FIGURE 26 

MASALWTVLPSRMSLRSLKWSLLLLSLLSFFVMWYLSLPHYNVIERVNWMYFYEYEPIYRQDFHF 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQE7\EKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KP IKFEDVYVG I CLNLLKVN I HI PEDTNL FFLYRI HLDVCQLRRVI AAHGFS SKE 1 1 T FWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCC T TGCC AAGAGAGTAC ACAGTC AT TAATGAAGCC TGCC C TGGAGCAGAGTGGAATATCAT GTGTCGGGAGTGC TGTG 
AATAT GAT C AGATTGAGTGCGTC TGCCC C GGAAAGAGGGAAGT CGTGGGTTATACC ATCC CT TGC TGC AGGAATGAGGAGAA 
TGAGTGTGACTCCTGCCTGATCCACCCAGGTTGTACCATCTTTGAAAACTGCAAGAGCTGCCGAAATGGCTCATGGGGGGGT 
ACC T TGGATGAC TTC TATGTGAAGG GGT TCTACT GT GCAGAGTGCCGAGCAGGCTGGTAC GGAGGAGACTGCATGC GATGTG 
GCC AGGT TC TGC GAGCCCCAAAGGGTC AGATT TT GT TGGAAAGC TATCC CC T AAATGCTC AC T GT GAATGGACCAT T CATGC 
T AAACC TGGGT T TGTCATCC AAC TAAGAT TTGTCAT GTT GAGT C TGGAGTT TGAC TACAT GT GC CAGTATGAC TAT GT TGAG 
JU. GTTCGTGATGGAGACAACCGCGATGGCCAGATCATCAAGCGTGTCTGTGGCAACGAGCGGCCAGCTCCTATCCAGAGCATAG 
f*% GATCC TC AC TC C ACGTCCTC T TCCACTCCGATGGC T C CAAGAAT TT TGACGGT TTCCAT GCCAT T TATGAGGAGAT CACAGC 

ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TGGGCAGCGC TGTGAAAATC TC C T TGAAGAAAGAAAC T GCTCAGACCCTGGGGGC CCAGT CAATGGGTACCAGAAAA 
TAAC AGGGGGC C CTGGGCTTATCAACGGACGCCAT GC TAAAAT T GGCACCGTGGTGTCT T TC T T T TGTAAC AACTC C TATGT 
TCTTAGTGGCAATGAGAAAAGAACTTGCCAGCAGAATGGAGAGTGGTCAGGGAAACAGCCCATCTGCATAAAAGCCTGCCGA 
GAAC CAAAGAT TTCAGACC TGGT GAGAAGGAGAGT T C TT CC GAT GCAGGTTC AGTC AAGGGAGACACCATTACACC AGC TAT 
ACTCAGCGGCCTTCAGCAAGCAGAAACTGCAGAGTGCCCCTACCAAGAAGCCAGCCCTTCCCTTTGGAGATCTGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTAC^ 

C TGAGGAC T GGGAAGTGGAGTGGGCGGGC ACCAT C C TGCAT CC C TATC TGCGGGAAAAT TGAGAACATCAC TGCTC C AAAGA 
CCCAAGGGT TGCGC TGGCCGTGGCAGGC AGCCAT C TACAGGAGGACCAGCGGGGTGCATGACGGCAGCC TACACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 



o 

Id 
\ n 



GTCAC CAT GATCAAGACAGC AGACC TGAAAGT TGT T T T GGGGAAAT TC TACCGGGATGAT GAC C GGGATGAGAAGAC CAT CC 
&f AGAGC CTACAGATT TC TGC T ATCAT TC TGCATCC CAACTAT GACCCCATCC TGCT TGAT GCT GACATCGCCATCC TGAAGC T 

I'sl CCTAGACAAGGCCCGTATCAGCACCCGAGTCCAGCCCATCTGCCTCGCTGCCAGTCGGGATCTCAGCACTTCCTTCCAGGAG 
ill TCCC ACATC AC TGTGGCTGGC TGGAAT GT CCTGGCAGAC GTGAGGAGCCCTGGCT TCAAGAAC GACAC ACTGCGCT C T GGGG 

TGGTCAGTGTGGTGGACTCGC TGC TGTGTGAGGAGCAGCAT GAGGACCATGGCATCCCAGTGAGT GTCAC TGATAACATGT T 
C TGTGCC AGCTGGGAACCCAC TGCC CC T T C TGATAT C TGCAC TGCAGAGACAGGAGGCAT CGCGGCTGTGTCCTTCCC GGGA 
CGAGCATC T CC TGAGCCAC GC TG GC ATC TGATGGGAC TGGT CAGCTGGAGC TATGATAAAACATGCAGC CACAGGC TCT CC A 
C TGC C T T CACCAAGGTGCT GCCT T T TAAAGAC TGGAT TGAAAGAAATAT GAAATGRACCATGC TC ATGC AC TCC TTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGCTTCTGACTTCAGGGACAAAACTCAGTGAAGGGTGAGTAGACCTCCATTGCTGGTAGGCTGATGCCGCGTCCA 
C TAC TAGGACAGC CAATTGGAAGAT GCCAGGGC T TGCAAGAAGTAAGT T TC TTCAAAGAAGAC C ATATACAAAACC TCT C C A 
CTCCACTGACCTGGTGGTCTTCCCCAACTTTCAGTTATACGAATGCCATCAGCTTGACCAGGGAAGATCTGGGCTTCATGAG 
GCCCCTTTT GAGGC TC TCAAGT TCTAGAGAGC TGCC TGTGGGACAGCC CAGGGCAGC AGAGCT GGGATGTGGTGCATGC CT T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECI S PFYRRLGSSRRTCLRTGKWS GRAPSC I PI CGKI ENI TAP 
Q KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALWERTVVVAAHCVTDLGKVTMIKTADL 
*** KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP I LLDADIAILKLLDKARIS TRVQPI CLAASR 

m 

I s DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSVVDSLLCEEQHEDHGIPVSVTDNMFCA 

||| SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 

£1! ERNMK 

1.. Important features of the protein: 

y 

isse Signal peptide: 

f K | amino acids 1-23 

||| EGF-like domain cysteine pattern signature. 

t«l amino acids 260-272 

111 

N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTT C CTACAT CCT CT CAT CT GAGAAT CAGAGAGCATAAT CTTCT TACGGGC CC GTGATTTATTAAC GT GGCTTAATC 
TGAAGGTTGTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATQAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GC CT CC CACT ACGGCCT GACCAAAGATAGGAAGAG GC GCT CACAAGAT GGCT GT CCAGACGGCT GT GC GAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
AC CCT GCCTACGT GT C CT C GGCAGAG GACGGGCAGC CAGCAAT CAGCCCAGT GGACT CT GGCCG GAGCAACCGAACT A 
GGGCACGGCCCTTT GAGAGAT C CACTATTAGAAGCAGATCATT TAAAAAAATAAAT CGAGCT TT GAGTGTT CTT CGAA 
GGACAAAGAGCG GGAGT GCAGTTGCCAAC CAT GCC GACCAGGG CAGG GAAAAT T CTGAAAACAC CACT GCCC CT GAAG 

U T CTTT CCAAGGTTGTACCACCT GATT CCAGAT GGT GAAATTAC CAGCAT CAAGAT CAAT C GAGTAGATCCCAGT GAAA 

GC CT CTCTAT TAGGCT G GT GGGAGGTAGC GAAACCCCACT GGTCCATAT CAT TAT CCAACACATTTATCGTGAT GGGG 
TGAT CGC CAGAGACGGC CGGCTACTGCCAGGAGACAT CAT T CTAAAGGT CAACG GGAT GGACAT CAGCAATGT CCCT C 

*S ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 

yj GCAGGAACAATGGACAGGCCC CGGAT GCC TACAGACCCC GAGAT GACAGCTTTCATGT GATT CT CAACAAAAGTAGCC 

CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GTGGCAT ATCGACAT GGT CAGCTT GAGGAGAAT GACC GTGT GTT AGC CAT CAATGGACAT GAT CT TCGATAT G GCA 

% GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 

ft 

GGAGCCCTGACAT CTTT CAGGAAGC CGGCT GGAACAGCAAT G GCAGCT GGTC CC CAGG GCCAGGG GAGAGGAGCAACA 
^ CTCC CAAGCCCCT C CAT C CTACAATTACT T GT CAT GAGAAGGTGGTAAATAT CCAAAAAGAC C CCGGT GAAT CT CTCG 

GCAT GACCGT CGCAGGGGGAGCAT CACATAGAGAAT GG GATTT GC CTAT CTAT GT CAT CAGTGTT GAGC C CGGAGGAG 
T GATAAGCAGAGAT GGAAGAATAAAAACAGGT GACATTTT GTTGAAT GT G GAT GG GGT C GAACT GACAGAGGT CAGCC 
^ GGAGT GAGGCAGT GGCATTATT GAAAAGAACAT CAT CCT CGATAGTACT CAAAGCT TT G GAAGT CAAAGAGTAT GAGC 

m 

%%Sf CC CAGGAAGACT GCAGCAGCC CAGCAGCCCT GGACTC CAACCAC AACAT G GCCCCACCCAGT GACTGGT C CCCAT CCT 

GGGT CAT GT G GCT GGAATTAC CACGGT G CTT GTATAACT GT AAAGATATT GTATTACGAAGAAACACAGCT GGAAGT C 
T GGGCTT CT GCATT GTAGGAGGTT AT GAAGAATACAAT GGAAACAAACCTTTT TT CAT CAAAT CCAT T GT TGAAGGAA 
CACCAGCATACAAT GAT GGAAGAATTAGAT GT GGT GATATTCT T CTT GCT GT CAATGGTAGAAGTACAT CAGGAATGA 
TACAT GCTT G CT T GG CAAGACTGCT GAAAGAACT TAAAGGAAGAATTACT CTAAC TATT GTTTCT TGGC CTGGCACTT 
TTTT ATAGA ATCAAT GATGG GTCAGAGGAAAACAGAAAAA.TCACAAATAG GCTAAGAAGT TGAAAC ACTATATT TATC 
TTGT CAGTTTTT ATATT TAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTATGAT CAT CTAAT GAAAGC CAGT T 
ACAC CTCAGAAAATATGATT C CAAAAAAATTAAAACTACTAGT TTTT T TT CAGT GTGGAGGATTT CT CAT TACT CTAC 
AACAT TGT TTATATT TT TT CTATTCAATAAAAAGC CCTAAAACAACTAAAAT GAT TGAT TT GTATACCC CACT GAATT 
CAAGCTGATTTAAAT TTAAAAT TT GGTATAT GCT GAAGT CTGCCAAGGGTAC ATTAT GGCCATTT TTAAT TTACAGCT 
AAAATATTTTTTAAAATGCATTGCTGAGAAACGTTGCTTTCATCAAACAAGAATAAATATTTTTCAGAAGTTAAA 
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FIGURE AO 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I SLMTDEPGLDNPAWSSAEDGQPAI SPVDSGRSNRTRARPFERS T IRSRS FKKINR 
ALSVLRRTKSGSAVANHADQGRENSENTTAPEVFPRLYHLIPDGEITSIKINRVDPSESLSIRLV 
GGSET PL VH III QH I YRDGVI ARDGRLLPGD 1 1 LKVNGMDI SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLR YGS PESAAHL I QASERRVHLWS RQVRQRS PD I FQEAGWNSNGS WS PG 
PGERSNT PKPLHPT I TCHEKWNI QKDPGESLGMTVAGGASHREWDLP I YVI SVEPGGVI SRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIYLKALEWEYEPQEDCSSPAALDSNHNMAPP 
S DWSPS WVMWLELPRCL YNCKDI VLRRNTAGS LGFCI VGG YEE YNGNKP FFI KS I VEGT PAYNDG 
RI RCGD I LLAVNGRS T SGMI HACLARLLKELKGRI TLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites . 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites* 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGT ATCT T CAGT TGT CATCAAGT TCGCAAT CAGATT GGAAAAGCTCAACTTGAAGCTT T 
CTT GC CT GCAGTGAAGCAGAGAGATAGATATTATT CACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGACCCTCAT TTT GGGGAAGGGAAAAACTCTGACTAAT GAAGCAT CCACGAAGAAGGTAGAACT T 
GACAACTGTCCT T CTGTGTCTCCT TACC TCAGAGGCCAGAGCAAGCT CATTTTCAAACCAGAT CTCAC 
TTT GGAAGAGGTACAGGCAGAAAAT CCCAAAGTGT C CAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 



CTT TACAGAGGGT CGC CATCC TCGT TCC CCACCGGAACAGAGAGAAACACCTGATGTACCTGCT GGAA 



, • : 



fij 



CAT CT GCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACT GCTTTATAT T CCACGATGTGGACCT GGTACCCGAGAATGACTTTAACCT T TACAAGTGTGAGGAG 
CAT CCCAAGCATCTGGT GGTTGGCAGGAACAGCACTGGGTACAGGTTACGT TACAGTGGATAT TT TGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTTGAGCTCCAAAGAATGAAAATTTCCCGGCCCCTG 
CCTGAAGTGGGTAAATATACAATGGTCTTCCACACTAGAGACAAAGGCAATGAGGTGAACGCAGAACG 
GAT GAAGCTCT TACAC CAAGT GTCACGAGTCT GGAGAACAGAT GGGTT GAGTAGT TGTTCTTATAAAT 



\f n TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
5 8 ;f TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 



ATAGTAGCAC ACAT T AAGAAC C T G T TACAGC T CAT T G T T GAGC T GAAT TTTTCCTTTTTG TAT T T T C T 
TAGCAGAGCTCCTGGTGATGTAGAGTATAAAACAGTTGTAACAAGACAGCTTTCTTAGTCATTTTGAT 
CAT GAGGGT TAAATATTGTAATAT GGAT ACT T GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTAT T TGAGGACTCTGGTT GAAGGAGATT TATT TAAATT T GAAGTAAT AT AT TAT GGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGTCTGAGAGAACCAGAGTTGTTCTCGTCCAAGGTAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGT TT TTGTAAAATGATTTT GTACAAGTAGGAT AT GAATT AGCAGTTTACAAGTT TACAT 
ATTAACTAATAATAAATATGTCTATCAAATACCTCTGTAGTAAAATGTGAAAAAGCAAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKZVLQRVAILVPH 
RNREKHIiMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLMVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNS TGYRLRYSGY FGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQYSRVWRTDGLSSCSYKLV 
SVEHNPLYINITVDFWFGA 



W Important features: 
Signal peptide: 

l^ji amino acids 1-27 

Cft N-glycosylation sites. 

tJ*l amino acids 4-8, 220-224, 335-339 

o 

51$ Xylose isomerase proteins. 

fl amino acids 191-202 
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FIGURE 43 



GC T CAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCAC TGAGC T CCCAGATC TGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAAC T T GCAGAGC TGCAACCC CAGGACAGAGC TGGAGCCAGGGCCAGCTGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
TCCAAAAAAAAAAAAAAAAAAAA?U^AAAAAAAAAAAAAAAAAA^ 



%J AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGS 
CI'FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site* 

amino acids 58-59 

N-myristoylation site. 

amino acids 44-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATCGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCT.TGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
AT CGTAATAGGGAGAGAGTAGACT T CCCAGAT GGAGGC TAC T CCCT GAAGCT CAGCAAAC TGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
|,l GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
fj AT AAGAATGGC AC C TG T G T GAC C AAT C T GAC ATG C T GC AT GGAACAT GGGGAAGAG GAT G T GAT T 

O TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
I CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
p TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
|f| GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGGTATTTCTTTG 
CO GT T T CTGAAGAGAGAGAGACAAGAAGAGTACATT GAAGAGAAGAAGAGAGTGGACAT TT GT CGGG 

* AAAC T CC TAACATATGCCCCCAT TC TGGAGAGAACACAGAGTACGACACAATCCCT CAC AC TAAT 

q 

agaacaatcctaaaggaagatccagcaaatacggtttactccactgtggaaataccgaaaaagat 
fi ggaaaatccccactc^ctgctcacgatgccagacacaccaaggctatttgcctatgagaatgtta 
ftl tctagacagcagtgcactcccctaagtctctgctca 
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FIGURE a6 

MAGSPTCLTLIYILWQLTGSAASGPVKELYGSVGGAVTFPLKSECVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFS S P I LARKLCEGAADDPDS SMVLLCLLLVPLLLS LFVLGLFLWFLKRERQEE YI E 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern • 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE A7 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATT CCAGCAACAACAATGTCCT TGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT TGT GAAT TT TCAT TGAAAAACAT CAGTGACAT TCATCCAGAATCC T TCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCATGGCGAGT GGC TGGAGAGCAT C TAGT T T CCAC T TCGAT T CT GAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T G TAG T T TAATGGGAATAAAAT GTAAGTATGAGTAG TT TGAAAAAAAAAAA 
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FIGURE 48 



MTCCEGWTSCNGFSLLVLLLLGWLNAIPLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
T TMSLTARKRACCNNRTGMFLSS FFSVI TVTGALYCML I S I QALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, . 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
H CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
O C T AC T T GC TGAGC ACAGGAC T G G C C T C CAGGGAT G G CC T GAAG C C TAAC AC T G G C C C C CAGC ACC 

m TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
U* TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



y i 
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FIGURE so 

MERVTIJU.LLIAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
SQKQHS PVPEKAI PLI TPGSATTC 

Important features : 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 

N-myristoylation sites. 

amino acids 41-47, 45-51, 84-90 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE 51 



GT GGACTCTGAGAAGCCCAGGCAGTT GAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAGT C GGAAGGAGGAGGACAGAGGAGGGCACAGAGAC GCAGAGCAAGGGCGGCAAGG 
AGGAGACCC TGGT GGGAGGAAGACACTCT GGAGAGAGAGGGGGCT GGGCAGAGATGAAGT TCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
Q TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
pi GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
^.f GAAACTCAGCAGGCAGC T TT GGAAT GAAT CC T CAGGGAGC TCC CT GGGGT CAAGGAGGCAATGGA 

#A GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
» AGC CAGCAACCAGAATGAAGGGTGCACGAAT CCCCCACCATC T GGC T CAGGTGGAGGC TCCAG CA 

CI ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AAT GGCAGCAGCAGT GGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGT GGCGGCAGCAG 



5 



TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
f|| CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTT7VACTTTGACACTTTCTGGAA 
GAAT T T TAAAT CCAAGC TGGGT TT CATCAAC T GGGATGCCATAAACAAGGAC CAGAGAAGC TCTC 
GCATCCCG TGACC T C CAGACAAGGAGCCACCAGAT TGGATGGGAGC CCC CACAC TCCCTCC T TAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^A^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 

MKFQGPLACLLLALGLGSGEAGPLQSGEESTGTNI GEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNWNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDT FWKNFKS KLGF INWDAINKDQRS SRI P 



Signal peptide: 

amino acids 1-21 



N-glycosylation site. 

amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 



ft I 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCCTATATAAGGATCT TCAACAAGAGTGCAAACATCATGCT TGACAAGTGGCAGCACCTGG 
CC T CAGAGGGCAGCAGTCGTCT GGACAT G TT T GAGCACATCAGC C T CAT GACCT TGGACAG TC T A 
C AGAAAT GCAT C T T CAG C T T T GAC AGC C AT T G T CAG GAGAG GC C C AG T GAAT AT AT T G CCACCAT 



ffl CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
CO ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
^ GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
^,1 AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
*i% CAT T GTCAGAT GAGGATATAAGAGCAGAGGC TGACACC T TCAT GT T TGGAGGCCATGACACCACG 

ffj GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCT TCCGCTT TGACCCAGAGAACAGCAAGGGGAGGTCACCTC TGGCTTT TAT TCCT TTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCAC T TCCGGT TCC T GCCAGACCACACT GAGCCCCGCAGGAAGC TGGAAT T GAT CATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 
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FIGURE 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDPCWQHLASEGSSRLDMFEHI 

S LMTLDS LQKC I FS FDSHCQERPSE Y I AT I LE LS ALVEKRS QH I LQHMDFL YYLSHDGRRFHRAC 

RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 

MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 

LAFIPFSAGPRNCIGQAFAMAEMKVYLALMLLHFR.FLPDHTEPRRKLELIM 

VGLQ 



Important features: 



Transmembrane domains : 



amino acids 13-32 (type II), 77-102 




Cytochrome P450 cysteine heme-iron ligand signature. 



amino acids 461-471 



N-glycosylation sites. 



amino acids 112-116, 168-172 
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FIGURE ss 



AT CGCAT CAATTGGGAGTACCAT C T TCC T CATGGGACCAGTGAAACAGC T GAAGCGAAT G T T TGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAATT CATGGCCAG T TT TATGAAGCT T T GGAAGGCACTATGGACAGAAGC TGG T GGACAGTT T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
PI GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
Q TC T GGAT GT T GT CCCAC T GAAT TCCCAT GAATACAAACC TAT T CAGCAACAGCAAAAAAAAAAAA 

tfl AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

If! 
pj 

=UJ 
III 

ll 
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FIGURE s6 



MGPVKQLKRMFEPTRL IAT IMVLLC FAL TLC S AFW WHNKGLAL I FCI LQS LALTWYSLS FI PFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGA6GGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTT T CAAGCT CAGT GT CTT CAT CC CCT CC CAGGAATT CT CCAC CTACC GCCAGT GGAAGCAGAAAATT GTACAAGCT 
GGAGATAAGGACCTTGATGGGCAGCTAGACTTTGAAGAATTTGTCCATTATCTCCAAGATCATGAGAAGAAGCTGAGG 
CTGGTGTTTAAGATTTTGGACAAAAAGAATGATGGACGCATTGACGCGCAGGAGATCATGCAGTCCCTGCGGGACTTG 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATTCT CAAGAGCAT GGATAAAAACGG CACGAT GAC CATCGACTGG 
AACGAGT GGAGAGACTACCACCT CCTCCAC CCC GT GGAAAACAT CCCC GAGATCATCCT CTACT GGAAGCAT T CCACG 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
CAC CT GGT G GC AG GAGGT GGGG CAG GGGCC GT AT C CAGAACCT GC AC GGCCC CCCTGGACAGGCT CAAGGTG CTCAT G 
CAGGT CCAT GCCT CCCGCAGCAACAACAT GGGCAT CGTT GGT GGCTTCACTCAGATGATT C GAGAAGGAGGGGCCAGG 
T CACT CT GG CGG GGCAATGGCATCAACGTC CT CAAAATT GCC CCCGAATCAGCCATCAAATTCAT GGC CTAT GAGCAG 
- . AT CAAGCGC CTT GTT GGTAGTGACCAGGAGACT CT GAGGATTCAC GAGAGGCTT GTGG CAGGGTCCTT GGCAGGGGCC 

I s * ATCGCCCAGAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
f"| AT GCT GGACT GC GCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCC GC CTT CTACAAAGGCTAT GT CCCCAACAT GCT G 

GGCATCATCCCCTATGCCGGCATCGACCTTGCAGTCTACGAGACGCTCAAGAATGCCTGGCTGCAGCACTATGCAGTG 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
ffjj CC CCTGGCC CTAGT CAGGACCC GGATGCAGGCGCAAGCCT CTAT T GAGGG CGCT CCG GAGGT GACCAT GAGCAGCCT C 

l. 5 TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
Ml GCTGTGAGCATCAGCTACGTGGTCTACGAGAACCTGAAGATCACCCTGGGCGTGCAGTCGCG GTGAC GGGGGGAGGGC 
Ifl CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCT GT CT CGAGC CAAGCT GT GAAAACCCTAGACGCAC CCGCAGGGAGGGT GGGGAGAGCT GGCAGGCCCAGGGCT T 
GT CCT GCT GACC CCAGCAGACCCT C CT GTT GGTT CCAGC GAAGACCACAG GCATTCCT TAGGGTCCAGGGT CAGCAGG 
III CT CCGGGCT CACAT GTGTAAGGACAGGACATT TT CT GCAGTGC CT GC CAATAGT GAG CTT GGAGCCT GGAGGC CG GCT 

TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
* CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
Q" ATAAT CCAT GAT GAAAGGT GAGGT CACGTGGCCTCCCAGGCCTGACTT CCCAACCTACAGCATTGACGCCAACTTGGC 

3 Si TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
f!| GGCCT GGACCCT GT CAG GAT GGGCC CCAC CTCAGAAC CAAACT CACT GTC CCCACTGT GGCAT GAGGGCAGT GGAGCA 

fff. CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
? ACTGT TGG GAAAAGGGT T TT GTC CAGAAGGACAAGCC G GACAAAT GAGCGACT T CT GT GCTTC CAGAGGAAGACGAGG 

%J GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
?|]| GGGACCAGC CCCACATT CCACT T GT GT CACTGCTT GGAACCT AT TTATTT TGT AT TTATT TGAACAGAGT TAT GT CCT 

5^ AACTATTTTT AT AGATT T GTTTAAT TAATAGCTT GTCATT TT CAAGTT CATTT T TTAT T CATATT TAT GT T CAT GGT T 

GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTG GGACT GGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAATCTTCAATAGGATGCAAAGATCAATGCAAAAATTGTTATATATGAACATATAACTGGAGTCGTCAAAAAG 
CAAAT TAAGAAAGAATT GGAC GTTAGAAGTT GT CATTTAAAGCAGCCTT CTAATAAAGTT GT TTCAAAGCTGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCL YVPVI GEAQTE FQYFE SKGLPAELKS I FKLSVFI PSQE FS T YRQWKQKI VQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHS T I FDVGENL TVPDE FTVEERQTGMWWRHL VAGGG 
AGAVS RT C T APLDRLKVLMQVHAS RSNNMG I VGG FT QM I RE GGARS LWRGNG I NVLK I APE S A I K 
FMAYEQ IKRL VGS DQE TLRI HERLVAGSLAGAI AQS S I YPME VLKTRMALRKTGQYS GMLDCARR 
I LARE GVAAFYKGYVPNMLG 1 1 PYAG I DIJWYE TLKNAWLQHYAVNS ADPGVFVLLACGTMS S TC 

M GQLASYPLALVRTRMQAQAS IEGAPEVTMSSLFKHI LRTEGAFGLYRGLAPNFMKVT PAVS ISYV 

O VYENLKI TLGVQSR 

o 

s.ji Important features: 

Iff Signal peptide: 

Ms amino acids 1-16 

SB 

I-,. Putative transmembrane domains: 

Ifl amino acids 284-304, 339-360, 376-394 

lU Mitochondrial energy transfer proteins signature. 

H amino acids 206-215, 300-309 

fll 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE so 



GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCC ATG GC 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CAC TCAT CAT TGGC TTTGGTATT T CAGGGAGACACTCCATCACAG T CAC TACTGTCGCCTCAGC T 
GGGAACAT T GGGGAGGAT GGAATC C TGAGCT GCACTT TT GAACCT GACAT CAAACT TT CT GATAT 
CGT GATACAAT GGCT GAAGGAAGGT GT T T TAGGCT T GGTCCAT GAGT TCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGAT GAAATGT T CAGAGGCCGGACAGCAGT GT T T GC TGAT CAAGT GATAGT T 
GGCAATGCCTCTTT GCGGCTGAAAAACGTGCAACT CACAGATGCT GGCACCTACAAAT GT TATAT 
CAT CAC T TC TAAAGGCAAGGGGAAT GCTAACCT TGAGTATAAAAC TGGAGCCT TCAGCAT GCCGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
TCAACAACACATACT CC T GTATGAT T GAAAATGACAT T GCCAAAGCAACAGGGGAT AT CAAAGT G 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GTGCCT TGGCCACAAAAAAGCAT GCAAAGTCAT TGT TACAACAGGGATC TACAGAACT AT T TCAC 
CACCAGATATGACC TAG T T T TAT AT TTC T GGGAGGAAAT GAAT T CAT AT C TAGAAGT CTGGAGT G 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCT TCAAAGACATATTAGAAGT TGGGAAAATAAT T CATGT GAAC TAGACAAG TGTGTTAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGT T GCTC T GAGGAAGCCC C TGGAAAGTCTATCCCAACATAT CCACATCT TATAT T CCAC 
AAATTAAGCTGTAGTATGTACCCT71AGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAAC T GACAAAT GCCAAAG TTGAGAAAAAT GATCATAAT T T TAGCATAAACAGAGCAGT 
CGGGGACACCGATT T TATAAATAAACTGAGCACCTTCT TT TTAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLK3WQLTDAGTYKC 
Y 1 1 TSKGKGNANLE YKTGAFSMPEVNVDYNAS SET LRCEAPRW FPQPT WWAS QVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVS S FFAI S WALLPLS PYLMLK 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites , 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
n GGT GGACTCAGATCAC AGTGGC TAT ATC TCCAT GAAGGAGC TAAAGCAGGCCCT GGTCAAC TGCA 

r% ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 
tfl GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
y* CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
!!:f AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
JJ| GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
, GGCCTTCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCTCAGCTTCGAGGACTTCGTCA 
Q CCAT GACAGC T TCT CGGATGCTATGACCCAACCAT CTGTGGAGAGT GGAGTGCACCAGGGACC T T 

T CCTGGCT TC T TAGAGT GAGAGAAGTATGTGGACAT CT C T TC T T T T CC T GTCCC TCTAGAAGAAC 
jj^j ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
**& AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 

W 

fU ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATT TGGGGCCAAAAGTC CAG TGAAAT TGTAAG CT TCAATAAAAGGAT GAAAC T C TGA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
S G Y I SMKELKQALWCNW S S FNDE T CLMM I NMFDKTKS GR I DVYG FSAL WKF I QQWKNL FQQ YDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RL S FE D FVTMT ASRML 

Important features of the protein: 
Signal peptide: 
amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
Cfl TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
III CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 

%n 

TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
Jl~ ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 

iy 

CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
Q GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
111 CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
2; GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
JJ; TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
p| TGCTCATCTCCAATAAAATAAAAGCACT^^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^^ 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of* the protein: 
Signal peptide: 

amino acids 1-2 6 



HI 



Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 




N-myristoylation sites. 

amino acids 48-53 , 220-225, 221-226, 224-229, 247-252, 258-263, 



259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACT ATG GGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
C T GAG T G GCAATAAATAAAAT T C GG T AT GC T G 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 37-41 

N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGT T CGAGGGAGGGACACGGAC CAGGAACC T GAGC TAGGT CAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCC ATG GCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATC TAG GTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGT GCCC T CT C T GGGCAGT CAGATC CACCCAGT GC T T AATAGCAGGGAAGAAGGTACT TCAA 
AGACTCTGCCCCTGAGGTCAAGAGAGGATGGGGCTATTCACTTTTATATATTTATATAAAATTAG 
TAGT GAGAT GT AAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLARWGRAWGQ IQTT SANENS TVLPS S TS S SS DGNLRPEAI TAI IWFS 
LLAALLLAVGLALLTOKLREKRQTEGTYRPSSEEQFSHAAEARA.PQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 




amino acids 8 6-90 



Tyrosine kinase phosphorylation site. 



Hi 



amino acids 8 6-94 



N-myristoylation sites. 



amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
TCAAATACTT CCT T CAT TAAGCT GAATAATAAT GGCT TT GAAGATAT T GT CATT GTTATAGAT C CTAGT GT GCCAGAA 
GATGAAAAAATAATT GAACAAATAGAGGATAT GGT GACTACAG CT TCTAC GTACCTGT TTGAAGCCACAGAAAAAAGA 
TTTT TTT T CAAAAAT GT AT CTATATTAATT CCT GAGAATT GGAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AAC CATAAACAT GCT GAT GTTATAGTT GCACCACCTACACTC CCAGGTAGAGAT GAAC CATACACCAAGCAGT TCACA 
GAAT GT GGAGAGAAAGGC GAATACATT CACTT CACCC CTGAC CT T CTACTT GGAAAAAAACAAAAT GAATATGGACCA 
CCAGGCAAACTGT TT GT C CAT GAGTGGGCT CAC CT CCGGT GGG GAGT GTTT GAT GAGTACAAT GAAGATCAGC CT TT C 
TAC CGT GCTAAGT CAAAAAAAAT C GAAGCAACAAGGT GTT CC G CAGGTAT CTCT GGTAGAAATAGAGTTTATAAGT GT 
CAAGGAGGCAGCTGT CTTAGTAGAGCAT GCAGAATTGATT CT ACAACAAAACT GTAT G GAAAAGATT GT CAAT T CTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT C CATAAT GT TTATGCAAAGTATT GATT CT GTTGTT GAAT TTT GTAACGAA 
AAAAC C CATAAT CAAGAAGCT C CAAGC CTACAAAACATAAAGT GCAATT TTAGAAGTACAT GG GAGGT GATTAGGAAT 
^7 TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
O AGAATTGT GT GCT TAGTT CTTGATAAGT CT GGAAGCAT GGGGGGTAAGGACCGCCTAAAT C GAAT GAATCAAGCAGCA 

# =% AAACATT T CCT GCT GCAGACT GT T GAAAAT GGAT C CT GGGTGG GGAT GGTTCACT TT GATAGTACTGC CACTATT GTA 

AATAAGCTAATCCAAATAAAAAGCAGT GAT GAAAGAAACACACT CAT GGCAGGAT TAC CTACATATCCTCT GGGAGGA 
CP ACTTCCAT CT GCT CT GGAATTAAATAT GCATTT CAGGTGATT G GAGAGCTACATT CCCAACT C GAT GGAT CC GAAGTA 

| s | CT GCT GCT GACT GAT GGGGAGGATAACACT GCAAGTT CTT GTATT GAT GAAGT GAAACAAAGT GGGGC CATT GTT CAT 

TZ, TTTATTGCTTTGGGAAGAGCT G CT GAT GAAGCAGTAATAGAGAT GAGCAAGATAACAG GAGGAAGT CAT TT TTAT GTT 

111 TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
m TCCCTTCAGCTCGAAAGTAAGGGATTAACACTGAATAGTAATGCCTGGATGAACGACACTGTCATAATTGATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
V4 ATAAT GGAAAAT TT CACAGT GGAT GCAACTTCCAAAAT GGCCTAT CT CAGTATTCCAGGAACT GCAAAGGTGG GCACT 

s TGGGCATACAAT CTT CAAGC CAAAGCGAAC CCAGAAACATTAACTATTACAGTAACTT CTCGAGCAGCAAATTC T TCT 

GTGCCTC CAATCACAGT GAATGCTAAAAT GAATAAGGACGTAAACAGTTT CCCCAGCCCAAT GAT T GTTTAC GCAGAA 
%s% ATT CTACAAGGATAT GTACCTGT T CTT GGAGCCAAT GT GACT G CTTT CATT GAAT CACAGAAT GGACATACAGAAGTT 

|ff TTGGAACTTTTGGATAATGGTGCAGGCGCTGATTCTTTCAAGAATGATGGAGTCTACTCCAGGTATTTTACAGCATAT 
%: ACAGAAAATGGCAGATATAGCTTAAAAGTT CGG GCTCAT GGAG GAG CAAACACT GCCAGGCTAAAAT TACGGCCT CCA 

O CTGAATAGAGCCGC GTACATAC CAGGCT GGGTAGT GAACGGGGAAAT T GAAGCAAAC CCGCCAAGACCTGAAATT GAT 

f| ii GAGGATACT CAGAC CACCT T GGAGGAT TT CAGC CGAACAGCATC CG GAGGT GCATTT GTGGTATCACAAGTCCCAAGC 

Ijf CTTCC CT T GC CT GACCAATACC CACCAAGT CAAAT CACAGAC CTT GAT GC CACAGTT CAT GAGGATAAGATTAT T CTT 

Q ACAT GGACAG CACCAGGAGATAAT TTT GAT GT T GGAAAAGTT CAACGTT ATAT CATAAGAATAAGTGCAAGTATT CTT 

fU GAT CTAAGAGACAGT TT T GAT GAT GCT CTT CAAGTAAATACT AC T GAT CT GT CAC CAAAGGAG GCCAACT CC AAGGAA 

" AGCTTTGCATTTAAACCAGAAAATATCTCAGAAGAAAATGCAACCCACATATTTATTGCCATTAAAAGTATAGATAAA 
AGCAATT T GACAT CAAAAGTAT CCAACATT GCACAAGTAACTTT GTTTAT C CCTCAAGCAAAT C CT GATGACATTGAT 
CCTACACCTACTCCTACTCCTACTCCTACTCCTGATAAAAGTCATAATTCTGGAGTTAATATTTCTACGCTGGTATTG 
TCT GT GATT G GGT CT GT T GTAATT GTTAACTTT ATTTTAAGT ACCAC CATTTGAACCTTAACGAAGAAAAAAATCTTC 
AAGTAGACCTAGAAGAGAGTTTTAAAAAACAAAACAAT GTAAGTAAAGGATATTT CT GAATCTTAAAATT CAT CCCAT 
GT GTGATCATAAACT CATAAAAATAATTTTAAGAT GT CGGAAAAGGATACTT T GATTAAATAAAAACACT CAT GGATA 
TGTAAAAACT GT CAAGATTAAAATTTAATAGT TT CATT TATTT GT TATTTTATTT GTAAGAAATAGT GAT GAACAAAG 
ATC CTTTTT CATACT GATAC CT GGTTGTATAT TATTT GAT GCAACAGT TTTCT GAAAT GATATTT CAAATT GCAT CAA 
GAAATTAAAATCAT CTAT CT GAGTAGT CAAAATACAAGTAAAGGAGAGCAAATAAACAACATTTGGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 



If I i! 



MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVS I L I PENWKENPQYKRPKHENHKHADVI VAPPTLPGRDEPYTKQFTECGEKGE Y 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIWKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITWAKMNKDWSFPSPMIVYAEILQGWPVLGAOTTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS I LDLRDS FDDALQVNT TDLS PKEANSKES FAFKPENI SEENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 



Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



APP ID=10063568 



Page 220 of 3 18 



FIGURE 71 



CT CC TT AGGT GGAAACCC TGGGAGT AGAGTACT GACAGCAAAGACCGGGAAAGAC CATACGT CC CCGGGCAGG GGTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGTGACTT GCCTGAGGGT GGACCAGAAGAAA.GGAAAGGT CCCCT CTTGCT GTT GGCTGCACATCAG GAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA. 
AGTAGAGAAGCT G CT CT GTGTGGT GGTTAACT C CAAGAGGCAGAACT C GTTCTAGAAG GAAATGGAT GCAAGCAG CT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGA ATGA TGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
5 CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
CGCAACTAC GTGAGCAGC CT GAAGCGGCAGAT C GCACAGCTCAAGGAGGAGCT GCAGGAGAGGAGTGAGCAGCT CAGG 
Q AAT GG GCAGTAC CAAGCCAGCGAT GCT G CT GG CCT GGGT CTGGACAGGAGCCCC C CAGAGAAAACCCAGGCC GACCTC 

rk CT GGC CTT C CTGCACTCGCAGGT GGACAAGGCAGAGGTGAAT GCTGGC GT CAAGCTGGCCACAGAGTATGCAG CAGT G 

CCTT T CGATAGCT TTACT CTACAGAAG GT GTACCAGCT GGAGACTGGCCTTACCCGC CAC C CC GAGGAGAAGC CTGT G 
|f| AGGAAGGACAAGCGG GATGAGTT GGT G GAAGC CATTGAAT CAGCCTT GGAGAC CCTGAACAAT CCT GCAGAGAACAGC 

|,| CCCAATCAC CGT CCTTACAC GGC CT CT GATTT CAT AGAAGGGAT CTAC CGAACAGAAAGGGACAAA.GGGACATT GTAT 

if; ? GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCCATCATGAAA 

II 1 GT GAAAAAT GAAAAGCT CAACAT GGCCAACAC GCTTAT CAAT GT TAT CGT GCCT CTAGCAAAAAGGGT GGACAAGTT C 
ffv CGGCAGTTCATGCAGAATTTCAGGGAGATGTGCATTGAGCAGGATGGGAGAGTCCATCTCACTGTTGTTTACTTTGGG 

AAAGAAGAAATAAAT GAAGT CAAAGGAATACT T GAAAACACTT CCAAAGCTGC CAACTTCAGGAACT TT ACCT TCAT C 

III CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
„ TTTTTCTGTGATGTGGACATCTACTTCACATCTGAATTCCTCAATACGTGTAGGCTGAATACACAGCCAGGGAAGAAG 

GTATT TTAT CCAGTT CTTT T CAGT CAGT ACAAT CCTGGCATAATATAC GGCCACCAT GATGCAGT CCCTCCCTT GGAA 
C3 CAGCAGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGAC 
|f| TT CAT CAATATAGGT GGGT TTGAT CT GGACAT CAAAGGCT GGGGCGGAGAGGAT GTGCACCT TTATCGCAAGTATCT C 

CACAGCAACCTCATAGT GGTAC GGACGCCT GT GCGAGGACTCT T CCAC CT CT GGCAT GAGAAGCGCT GCAT GGAC GAG 
\J CT GAC CC CC GAGCAGTACAAGATGT GCAT GCAGT CCAAGGCCAT GAACGAGGCAT CC CACGGC CAGCT GGGCATG CT G 

f%v. GT GT T CAGGCACGAGATAGAGGCTCACCTT CGCAAACAGAAACAGAAGACAAGT AGCAAAAAAACATGAACT CCCAGA 

Ijjf GAAGGATT GT GGGAGACACTTT TT CTTT CCTT TT GCAAT TACT GAAAGTGGCT GCAACAGAGAAAAGACTT CCATAAA 

lj GGACGAOy^AGAATTGGACTGATGGGTCAGAGATGAGAAAGCCTCCGATTTCTCTCTGTTGGGCTTTTTACAACAGA 
f|S AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGT^lGTGTCTGACAAAGGCAGAATGCTTGTG 
AGATTATAAGCCTAATGGTGTGGAGGTTTTGATGGTGTTTACAATACACTGAGACCTGTTGTTTTGTGTGCTCATTGA 
AATATTCATGATTTAAGAGCAGTTTTGTAAAAAATTCATTAG(^TGAAAGGCAAGCATATTTCTCCTCATATGAATGA 
GCCTATCAGCAGGGCTCTAGTTTCTAGGAATGCTAAAATATCAGAAGGCAGGAGAGGAGATAGGCTTATTATGATACT 
AGTGAGTACATTAAGTAAAATAAAATGGACCAGAAAAGAAAAGAAACCATAAATATCGTGTCATATTTTCCCCAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT T TTTCCCTTGT GAGTTATAGT CT GCTTATT TAATTACCACTTT GCAAGCCTTACAAGAGAGC ACAAGT TGG CCTAC 
AT T TTTATATTTTT TAAGAAGATAC TTT GAGAT GCATTAT GAGAACT TTCAGTTCAAAGCAT C AAATT GAT GC CATAT 
CCAAGGACATGCC7\AATGCTGATTCTGTCAGGCACTGAATGTCAGGCATTGAGACATAGGGAAGGAATGGTTTGTACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
AC T T T CTGCT TTACAGAAAAGGAAACT CAT TCAGACT GGT GATAT CGT GAT GTAC CT AAAAGT CAGAAACCACAT TTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT GCT C CTCCTGGCTT CTGGCT TCCATAAGAAGAAAT GGAGAAAAATATATATATATATATATATAT T GT 
GAAAGAT CAATCCAT CT GCCAGAAT CT AGT GGGAT GGAAGTTT TT GCTACAT GTTAT CCACC CCAGGC CAGGT GGAAG 
TAACTGAATTATTTTTTAAATTAAGCAGTTCTACTCAATCACCAAGATGCTTCTGAAAATTGCATTTTATTACCATTT 
CAAA.CTATTT TTTAAAAATAAATACAGTTAA.CATAGAGT GGTT T CTT CATT CAT GT GAAAAT TATTAGCCAGCACCAG 
ATGCATGAGCTAATTATCTCTTTGAGTCCTTGCTTCTGTTTGCTCACAGTAAACTCATTGTTTAAAAGCTTCAAGAAC 
AT T CAAGCTGTT GGT GT GTTAAAAAAT GCATT GTATT GATTT GTACT G GT AGTTTAT GAAATT T AAT TAAAA.CACAGG 
CCAT GAAT GGAAGGT GGT ATTGCACAGCTAATAAAATAT GAT TT GTGGATAT GAA 
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FIGURE 72 



MM^T^RGLLAWISRVVVLLVLLCCAISVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEWAGVKIATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRPYTAS DFI EG I YRTERDKGTL YELT FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLIOTIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVVYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVTOTPWGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Ami da ti on site. 

amino acids 377-381 
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FIGURE 73 

GAGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T TGT CCT GGGGAT CCA 
GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATAT TCAGACAGACAGT GTGACAT TTGACCCTGATT TGCC 
T GC T CT GCAGC C C C ACGGATAT GAC CAG CCT G GCAC CGAGCC T T T G GAC C T GCACAAC AAT GG C C 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGC TCCACC T GCAC T GGGGT CAGAAAGGAT CCCCAGGGGGGTCAGAACACCAGATCAACAG 
TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
AG GC T G C T GAGAG GCC T C AGGGCC TGGCTGTCC T GG GCAT C C T AAT T GAGG T GG G T GAGAC T AAG 
AATATAGCT TATGAACACAT T C TGAGTCACT T GCATGAAGTCAGGCATAAAGAT C AGAAGACCT C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
ATTTCAATGGAACAGCTGGAAAAGCTTCAGGGGACATTGTTCTCCACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCT CGTATACCACAGGTGAAATGC TGAGTC TAGG T G TAGGAATCT T GGT TGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGT GTGGTCTT CACC TCAGCACAAGCCAC GAC T GAGGCATAAAT T CCTT C T CAGATAC 
CAT GGATG TGGATGACT T CCC T TCATGC C TATCAGGAAGCCT C TAAAATGGGGT GTAGGAT C TGG 
CCAGAAACAC T G T AGGAG TAG TAAGCAGATGTCCT CCTT CCCC TGGACATC TC T T AGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGCTGTTGAAGTTGTATATTTTTGATCAATATATTTGGAAATTAAAG 
TTTCTGACTTT 
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FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVT FDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 
amino acids 1-15 



Ml Transmembrane domain: 

— 1 amino acids 291-310 

3 

o 

|p N-glycosylation site. 

Q amino acids 213-216 



Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 7S 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGAC TCCACCAGAGGGTCTTCAAGGAGTTAAAGT TACT TACACTGTGCAGTATT TCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
T C T G T GAT G GG C TAT T C CAT C T AC C GAT AT AT C C AC G T T G G C AAAGAGAAAC AC C C AGC AAAT T T 
GAT T TT GATT TAT GGAAAT GAATT TGACAAAAGATT CT T T GT GCC TGCT GAAAAAAT CGTGAT TA 
AC T T TATCACCC T CAATAT CT CGGATGAT TC TAAAATT TC TCATCAGGATATGAGT T TAC T GGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGTGAAACATT TAGGGTATGCT T CGCATT TGATGGAAATTT TTTGTGACTCTGAAG 
AAAACACGGAAGGTACTT CT C TCACCCAGCAAGAGT CCC T CAGCAGAACAATACCCCCGGATAAA 
ACAGTCAT TGAATAT GAATAT GATG T CAGAACCAC T GACAT TTGTGCGGGGCC T GAAGAGCAGGA 
GCT CAGT TTGCAGGAGGAGGT GTCCACACAAGGAACAT TAT TGGAGT CGCAGGCAGCGTT GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTAT CT CATGCAAT TCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MS YNGLHQRVFKELKLLTLCS I S S QI GPPE VALT TDEKS I S WLTAPEKWKRNPE DLPVSMQQ I Y 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
I VINFI TLNI S DDS KI SHQDMSLLGKS SDVS SLNDPQPSGNLRPPQEEEEVKHLGYASHLME I FC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 14 0-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCT GAAGGACCACAACGCCACCAGCATC CT GCAGCAGC TGCCGC TGC T CAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAAT7^ACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGT GACCAACT TATACT CAACT TGAATAACAT CAGC TC T GATCGGAT CCAGC TG 
ATGAAC TC TGGGAT T GGCT GGT TCCAAC CTGAT GT T CT GAAAAACAT CATCAC T GAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
T TG TGGAAACC CAGC TCT CCT GTCT CCCAGTGAAGACT T GGAT GGCAGCCATCAGGGAAGGC T GG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKHI I WLKVI TANILQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNAIAKQVMNLLVPSLPNL 
VKNQLC P VI EAS FNGMYADLLQLVKVP ISLSI DRLE FDLLYPAIKGDT I QL YLGAKLLDS QGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAMKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAE S S LTKDALVLT PASLWKP S S P VS Q 

Important features of* the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites . 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site. 

amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAGT CAGCCTGGCAGAGAGAC T C TGAAAT GAGGGATT AGAGGTGT TCAAGGAGCAAGAGC 
TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 
CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 
GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 
CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 
ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 
^ CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 
|K GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 
m ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 
hi CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 
Ifl CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 
f2 CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 
GTTCAATTCCTACAGCCTGACAGGGTATGTG TGAA GAACCAGGGGCCAGAGCTGGGGGGTGGCTG 
:f| GGT CTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGT GAGGGACAC TACCACT GGATCGT 

HI G T C AGAAG GTGCTGCT GAG GAT AG AC T G AC T T T G G C CAT T G GAT T GAG C AAAG G C AG AAAT G G G G 

O GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 
j^f TCACCTTGCTGCTCCCCTGCCCT7y^TCCCCAACCCTCAACTTGAAACCCCATTCCCTT7\AGCCA 
GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 
TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GAC TAAT TTGT GCATGAACT GAAATAAAACCATCC TACGGTATCCAGGGAACAGAAAGCAGGATG 
C AGGAT GGGAG GAC AGG AAG GCAGC C T G GGACAT T T AAAAAAAT A 
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FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVT S SA I S S LAC 1 1 S WGMRCTVFCQE SRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains : 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 



APP ID=10063568 



Page 230 



FIGURE 81 

CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGG ATG GTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
AC T GCCAGAC T CCT GCT GCCAAGCC TGCAAAGATGAGGCAAGT GAGCAATC GGAT GAAGAGGACA 
GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
CAGACCCAAGGGAGCAGGCAGCACAAC TGT CAAGATCGT CCT GAAGGAGAAACATAAGAAAGCC T 
GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGAC T TGGT GGAGAT CTACC TC TGGAAGC TGGT AAAAGATGAGGAAAC TGAGGCT CAGAGAG 
G TGAAGTACC T GGCCCAAGGCCACACAGCCAGAAT CT TCCACT TGAC TCAGATCAAGAAAG T CAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACCTAACAGT TGCAGATATGAGCTGTAT AATTGT TGTTAT TATATATTAAT AAA 
TAAGAAGT TGCATTACC C T CAA?U\AAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLS S LLGLALLW FPLDS HARARPDMFCL FHGKRYS PGES WHP YLEPQGLM YCLRCTC S E 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHTSVS PS PDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTC AAG CT GGGATCAGGGC AGT GGCAGGT GTT TGGGCC AGACAAGCCT GT CCAGGCCTT GGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CCAGTTCTCTAGCGTGGTCCACCTCTACAGGGACGGGAAGGACCAGCCATTTATGCAGATGCCACAGTATC 
AAGGCAGG ACAAAACT GGTG AAGG AT TCT ATT GCGG AGGGGCGC AT CTCT CT GAGGCT GGAAAACATT ACT 
GT GTTGGAT GCT GGCCTCT ATGGGTGC AGGAT TAGTTCCC AGTC TTACT ACC AGAAGGCCAT CT GGGAGCT 
ACAGGT GT CAGC ACT G GGCT CAGT TC CTCT CATT T CCATC ACGGGAT ATGTT GAT AGAGACATCC AGCT AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
If! CGCCGGGAGC AT ATCC TGTT CC AT GCGGCAT GCT CAT C TGAGCCGAGAGGTGGAAT CC AGGGTACAGAT AG 

m 

V * GAGATACCTT TT TC GAGCCT AT ATCGT GGCAC CT GGCT ACCAAAGT ACTGGGAAT ACT CTGCTGTGGCCT A 

TT TTTT GGCATT GT TGGACT GAAG AT T T TCTT CT CC AAAT TCCAGT GGAAAATCCAGGCGGAACT GGACT G 
GAGAAGAAAGC ACGGACAGGCAGAAT T GAG AGACGCCC GGAAAC AC GC AGTGGAGGTGACT CTGGATCC AG 
AGACGGCT CACCCGAAGC TCTGCGTT T CTGAT CT GAAAACTGT AAC C C AT AGAAAAGCTCCC CAGGAGGT G 
f*i CCTC ACTC TGAG AAGAGATT T ACAAGGAAG AG TGTGGT GGCT TCTC AG AGTT TCCAAGCAGGGAAACATT A 

f|j! CT GGGAGGTGGACGGAGGAC ACAATAAAAG GT GGCGCGT G GGAGTGT GCCGGGAT GAT GTGGACAGGAGGA 

Q AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
f|| ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTG GGAC CATCTC CT TC TT C AAC AT AAAT GACC AGT CCCTT ATT T AT ACCCTGAC AT GT CGGT TT G 
AAGGCT T ATT GAGGCCCT AC AT TGAGT ATCCGTCCT AT AATGAGCAAAAT GGAACT CCCAT AGT CAT CTGC 
CCAGTC AC CC AGGAAT CAGAGAAAGAGGCCT C TT GGC AAAGGGCCT CTGCAATCCC AGAG AC AAGC AACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GACATT ACAT TT AGTT T GCT CT CACT CC AT CT GGCT AAGTGATCT TGAAAT ACC ACCT CT CAGGTGAAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
AT CTT ATT GAT G AC AGAGT GT ATC CT AATGGT T T GT T C AT T AT ATT AC ACTT TCAGTAAAAAAA 
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FIGURE 84. 

MAmLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGS VPL I S I T G YVDRD I QLLCQS S GW FPRPT AKWKGPQGQDL S TDSRTNRDMHGL FDVE I S L 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 85 



AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAA.CCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GC TC TGGGGGAGGGAGAGGGCGGAAGGACAGACAAG TAAACT GC T GACGATGCAGAG T TCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT T GCACCCT GAGCAT CAGAGATGCCAGAAGAAGTGAT GCGGGGAGA 
l £ * TAC TTC T T TCGTAT GGAGAAAGGAAGTATAAAAT GGAAT T AT AAACATCACCGGCT C T CT GTGAA 

J;f TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
I|| GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
111 GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 



m 



**% GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

|f| GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

O CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

M GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

ri 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGC CT TGGGACT CGCGGGGACAGGAGGCCAC TGACACCGAGTAC TCGGAGAT CAAGATCCACAG 
ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCT GATTCT T GTAGAAT T AACAGCCC TCAACGT GAT GAGC TATGATAACAC TAT GAAT TATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPAEUVVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFWVRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGC TGCACT CTGT T GAGC T CCAGGGCGCAGT GGAGGGAGGGAGTGAAGGAGC T C TC TGTAC 
CCAAGGAAAGT GCAGCT GAGAC T CAGACAAGAT TAC AATGA ACCAAC TCAGC TTCCTGCTGTTTC 
T CAT AG C GAC C AC CAGAG GAT GGAG TACAGAT GAG G C T AATAC T T AC T T C AAG GAAT GGAC CTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGC GATC GC TGGT CCAGT CAGCAGGGCAGCAAAGCAGAC TACCCAGAGGGGGACGGCAAC TGGGC 
CAAC TACAACACC T T TGGATCTGCAGAGGCGGCCACGAGCGATGAC TACAAGAACC C T GGCT ACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGC CAGC GGGAAT TCACT GCGGGAT TTGT TCAGT T CAGGGTAT TTAATAAC GAGAGAGCAGCCAA 
CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAAC T TACCCAGTAGCTAGAATGT TAAT GGCAGAAGAGAAAACAATAAATCATATT GACT CAAGA 
AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYS PYGQRE FTAG FVQ FRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 




N-glycosylation site. 



amino acids 163-167 



Glycosaminoglycan attachment sites. 




N-myristoylation sites, 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 



amino acids 74-78, 289-293 



I'll 



APP ID=1 0063568 



Page 238 of 3 18 



FIGURE 8q 

CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGAGGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
TTAAGACTTATATACAGTTTTAGGGGACAATTAAAAAAAAAAAA 

% 

v.? I 

tn 

a** s 
HI 
&~ z ,i 

ni 



III 
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FIGURE QO 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Qi 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
I s4 GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
if CCAACACAAT GCCACAGGCCAGGGAAACATCT CCAT CAGCCT C GTGC CCC CCAGTAAAGC T GTAG 

|p AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
Id GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
111 CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 

m 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
s ' 

TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 

If| AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 

O AGAAAGGGTCC CAAG T GCT GG T CCCAACC TGAAGCT G TGGAGTGAC TAGATCACAGGAGCACT GG 

{Jf AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 

2f! TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 

CCTTGGTTCTT GCCATC C T GAGGAAAGATAGCAACAGGGAGGGGGAGAT T T CAT CAGT G T GGACA 

GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 

GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 

CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 

CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 

GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 

CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 

GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 

CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 

TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 

TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCC FVFLVQGS L YLVI CGQDDGP PGSEDPERDDHEGQPRPRVPRKRGH I S PKSRPMANS TL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 



id 



Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 



2-oxo acid dehydrogenases acyl transferase 

*~ amino acids 61-71 

y 
If! 

rii 

3 5 
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FIGURE Q3 

CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTC6GGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGT TCCGAT TTGCATATTATAAACTCTTAAAAAAAGCCAGTGAAGGTT TGAAGAGTATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
H GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
% ^ GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
fll ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
ijj TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
|f! TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
W AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
m AACC TCAGGGAACCAGCACT T CCCAAACCGCAGACTACAT CT T TAGAGGAAGCACAAC TGTGCC T 

ft TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Qd 



MTAAVFFGCAFI AFGPALAL YVFT IAIEPLRIIFL I AGAFFWL VS LL I S S LVWFMARVI I DNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVS AQT FI S S Y YG I NLASAF 1 1 L VLMGT WAFLAAGGS CRS LKLC LLCQDKNFLL YNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 



AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGG T AGCAAT CT GAAACATGAGGAG TACGAT T CTAC TGT T T TGTCT TCTAGGAT CAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGT C T T T CCT T C TT TAAGTCT GATAC CAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACC CACCCAT TGACCC T GGGAGGGT TGAAT G TACAACAGCAAC TGCACCCACAT GT GT TAC 
CAAT TT T TGT CACACAAC T TGGAGCCCAGGGCACTATC C TAAGCT CAGAGGAAT TGCCACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGT CCAGGATGGAAGCCT TCCAGCAGGAGGAGCAGGTGTAAAT CCTGC CACC CAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT TGAT TAT AT TAT GGAATAGATTGAGACACATT GGATAGTCTTAGAAGAAAT TAAT T CT TAAT T 
TACCTGAAAATAT T CT TGAAAT T TCAGAAAATAT GT TC TATGTAGAGAAT CCCAACT T T TAAAAA 
CAATAAT TCAAT GGAT AAAT CTGTCTTT GAAATATAACAT TATGC TGCC T GGAT GATATGCATAT 
T AAAAC ATAT T T G G AAAAC T G GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE q6 



MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPI FVTQLGAQGT I LSSEE 
LPQ I FTS L 1 1 HS L FPGG I LPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPT PSG 
TDDDFAVTT PAG I QRS THAIEEATTES ANG I Q 

Signal peptide: 

amino acids 1-16 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACC ATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCT ACAGGTCACCC TGGAGAT GCAGGATGGACATGTC T T GT GGGGT CCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
GGAT GAGCCAGGCACAGCCAAC TCGGAT C T TCGAT T CCACAT C CT GAGCCAGGCT CCAGCCCAGC 
CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
ACCAGCC TT GACCACGCCCT GGAGAGGACCTACCAGCT GT T GGTACAGGTCAAGGACAT GGGT GA 
CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
TGCAGAGGGAAACCTCTACGTGACCAGAGAGCTGGACAGAGAAGCCCAGGCTGAGTACCTGCTCC 
AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
CAGTCCACCAGGTAC TGAAGT GACTAGAC TGT CAGCAGAGGAT GCAGATGCCCCCGGCTCCCCCA 
ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T CAC C C AC T G GAC CAT G T C AAGGAAG AAG GAC C C G GAT C AAC C AGC AGAC AG CGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCCTGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCTCCA 
TCT GCCC TGGGG T GGAGGCACCATCACCATCACCAGGCAT GT C TGCAGAGCC TGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S TSLDHALERTYQLLVQVKDMGDQAS GHQATATVEVS I IES TWVS LEP I HLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEVWWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 



QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PVWS HNAQMWQLLVRVI VCRCNVEGQCMRKVGRMKGMP T KLSAVG I LVGT LVAI GIFLI 



IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 



in 



LI FTHWTMS RKKDPDQPADSVPLKATV 




Signal peptide: 



amino acids 1-18 



Transmembrane domain: 



III 



amino acids 762-784 
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FIGURE qq 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCTAGGACCCAGGCATCTT GCTT TCCAGCCACAAAGAGACAG ATGA AGATGCAGAAAGGAAATG 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGT CACCAACTCT GAGT T CCATACAACC TCCAGT GGGATCAGCACAGCCACCAACT CT GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCAC TGC CACCAAC TCTGAGT C T AGCACAC TC TCCAGTGGGGCCAGCACAGCCAC 
CAACTCTGACTCCAGCACAACCTCCAGTGGGGCTAGCACAGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAAC T CT GAGT C CAGCACAACCTCCAG T GGGGCCAGCACAGCCACCAACTC T GAGT CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAAC TCT GACT CCAGCACAGTGT CCAGT GGGGCCAGCAC TGCCACCAACT C TGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGC TAG C ACAG CCACCAAC TCT GAC T C C AGCAC AAC C T C C AG T GGGGCC GGCACAG C CAC CAAC T 
CTGAGTCCAGCACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACACCCTCC 
AGTGGGGCCAACACAGC CACCAAC T C TGAGT CCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
CAACTCTGAGTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAGTCCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAAC T C TGAC T CCAGCACAACC TC CAGTGAGGCCAGCACAGCCACCAACTCT GAGT CTAG 
CACAGTGTCCAGTGGGATCAGCACAGTCACCAATTCTGAGTCCAGCACAACCTCCAGTGGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CC TAACT GGT T CT GGAGGAGACCAGTAT CAT C GATAGCCATGGAGAT GAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATC T T GAAGAAGGTATT CCTCACC TT T C TTGCC T T TACCAGACAC TGGAAAGAGAATAC TATAT 
TGCTCATTTAGCTAAGAAATAAATACATCTCATCTAACACACACGACAAAGAGAAGCTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATC TGGCAT T CAAAAT CTC CACAGTAAAATC CAAAGACCT CAAAAMAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA&AAAAAAAA 
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FIGURE loo 

MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGYSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGL FAGL FFCVRNS L S LRNT FNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNW FWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 



APP ID=1 0063568 



Page 250 



FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGAT GTGAT ATATGCAT TCCAGGGGAAGGGAAATT GTGGT GCT TCT GAACCCAT GGT CAAT T 
AACGAGGCAGTT TCTAGCTACTGCACGTACT TCATAAAGCAGGACTCTAAAAGCTT TGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAAC TGGGGAT GCAT T TGT TCC TGGAGAAAGAAGT GT CAT TATCATGAACCATCGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C TTCAT TCATAGGAAAT GGAAGGATGACAAGAGCCATT T CGAAGACAT GATTGAT TACT T T TGTG 
ATATTCACGAACCACT T CAAC TCCT CATATTCCCAGAAGGGAC TGATCTCACAGAAAACAGCAAG 
TC T CGAAGTAAT GCAT T T GCT GAAAAAAATGGAC TT CAGAAATATGAATATGTTT TACATCCAAG 
Hi AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
m CCT TCAACTC TGGT GCCACAAAC GGTGGGAAGAGAAAGAAGAGAGGC T GCGT T CCT T CTAT CAAG 

GGGAGAAGAAT T T T TAT T T TACCGGACAGAGT GT CATT CCACC T TGCAAGTC T GAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATAT T TGG TGGACT GGAGATCATAGAAC T TGCAT GT TACCGACT TT TACACAAACAGCCACAT 
T TAAATTCAAAGAAAAAT GAGTAAGAT TATAAGG T T TGCCAT GTGAAAACC TAGAGCATAT T TTG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTAT T T GT TAAAGATAT T TT GCAC T T AAT TT TGTGGGAAAAATAT TGC TACAAT T TT T 
T T T AAT C TCT GAAT G T AAT T T C GAT AC TGTG T AC AT AG GAG G GAG T GAT C GGGGT GAAAT AAC T T 
GGGCCAGAATAT TAT T AAACAATCATCAGGC T T T TAAA 
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FIGURE 102 



MHSRGRE I WLLNPWS INEAVS S YCTYFIKQDSKS FGIMVSWKGI YFILTLFWGS FFGS I FMLSP 
FLPLMFVNPSWYRW INNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LEI IELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 103 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAAT CATCCAT CCACCCC TGC T GT CAT C T GT T T T CATAGT GTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CTACAGAGAT GGGGAAGAC TGGGAAT CTAAGCAGAT GCCACAGTATC GAGGGAGAAC T GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
M> GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
H AGGAT TTGTC TTCAGACTCCAGAGCAAATGCAGATGGGTACAGCC TGTATGATGTGGAGATCTCC 

S AT TATAGTCCAGGAAAATGCT GGGAGCATAT T GT GT TCCAT CCAC C T TGC TGAGCAGAGT CAT GA 

1Z GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
y- TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
til AAATCCAAAGGGAAAATCCAGGCGGAAC TGGACT GGAGAAGAAAGCACGGACAGGCAGAAT TGAG 

Iff AGACGCCCGGAAA.CACGCAGT GGAGGT GAC TC TGGATCCAGAGACGGC T CACCCGAAGC T C T GCG 

3|| T T TC T GATCTGAAAAC T GTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCAC T C T GAGAAGAGA 

% TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
W CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
3 AT GTGACT TTGT CT CCCAACAAT GGGTAT T GGGT CC TCAGAC TGACAACAGAACAT T T GTAT T TC 

|3 ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
1 53 GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
%l TGACATGTCAGT TTGAAGGCT TGT TGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 

y GGGAC TCCCATATT CATAT G T CCAGTGT CCTGGGGATGAGACAGAGAAGACCC TGCT TAAAGGGC 

Fll CCCACACCACAGACCCAGACACAGCCAAGGGAGAGTGCTCCCGACAGGTGGCCCCAGCTTCCTCT 
|;| CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
iT« TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
3 ^ TCAGAAGCCATGGCTGCCCTGAAGTGGGGACGGAATAGACTCACATTAGGTTTAGTTTGTGAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT TCCT GC CT CACAGGTGAAGAT TAAAGAGACAACGAAT GTGAATCAT GCT TGCAGG T T 
T GAGGGCACAGT GT TT GCT AATGAT GTGT T T T TATATTATACATT T TCC CACCATAAAC T CTG T T 
TGCTTATTCCACAT TAATTTACTT TTCTCTATACCAAAT CACCCATGGAATAGT TAT TGAACACC 
TGCT T TGTGAGGC T CAAAGAATAAAGAGGAGGTAGGAT T TT TCAC TGAT T CTATAAGCC CAGCAT 
TACCTGATACCAAAACCAGGCARAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AACACAGAC AC AAAAAT T C T AAAT AAAAT T T T AAC AAAT TAAAC TAAAC AAT AT AT T T A 
AAGATGATATATAACTACTCAGTGTGGT TTGTCCCACAAAT GCAGAGT TGGT TTAATAT TTAAAT 
ATCAACCAGTGTAATTCAGCACATTAATAAAGTAAAAAAGAAAACCATAAAAAAAAAAAAAAA 
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MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWE S KQMPQYRGRTE FVKDS I AGGRVSLRLKNI TPS D I GLYGCWFS SQI YDEEATWELR 
VAALGSLPLISIVGYVDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
I VQENAGS I LCS IHLAE QSHE VE S KVL I GE T FFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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-Jr. 



CCTTCACAGGACTCTTCATTGCTGGTTGGCAATGATGTATCGGCCAGATGTGGTGAGGGCTAGGAAAAGAG 

TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 

CT C ACT GT TCAT T ATGT G AGAT AT AATC AAAAGAAGACCT AC AATT ACTAT AGC AC ATTGTC AT TT ACAAC 

TG ACAAAC T AT ATG CT GAGT T T GGC AGAGAGGCT TCTAAC AATTTT AC AG AAATGAGCCAGAGACTTG AAT 

CAATGGTGAAAAAT GC ATT T TATAAATC TCCATT AAGGGAAGAATT T GTC AAGTCT C AGGTT ATCAAG TT C 

AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 

AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 

AT CCTCAC TC AG TT AAAATT AAAAAAAT CAAC AAGACAGAAAC AGACAGCTATCT AAACCAT TGCTGCGGA 

ACACGAAGAAGTAAAACTCTAGGTCAGAGTCTCAGGATCGTTGGTGGGACAGAAGTAGAAGAGGGTGAATG 

GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 

TT GTGAGT GCT GCTCACT GT TT TACAAC AT AT AAGAACCCTGCC AG ATGGACTGCT TCCTT T GG AGTAAC A 

Iff AT AAAACC TTCGAAAAT G AAAC GGGGTCTCCGGAGAAT AATT GT CC AT GAAAAAT ACAAACACCC ATCAC A 

111 TGACTATGAT AT T T CT CT TGCAGAGCTT TCT AGCCCTGTT CCCT AC AC AAAT GC AGTACAT AGAGT TT GT C 

W TCCCTG AT GC AT CC TAT GAGT TT CAACC AGGT GATGTGAT GTT T GT GACAGGATT T GGAGCACTGAAAAAT 

* GATGGTTACAGTCAAAATCATCTTCGACAAGCACAGGTGACTCTCATAGACGCTACAACTTGCAATGAACC 

TC AAGCTT AC AATG ACGC CAT AAC TCCT AGAATGTT AT GT GCT G GC T CCT T AGAAG GAAAAACAGATGCAT 

GCC AGGGT GACT CT GG AG GACC AC TGGT T AGT TCAGAT GCT AGAGAT ATCTGGT AC CT TGC TGGAATAGT G 

AGCTGGGG AG AT GAAT GTGCGAAACCCAAC AAGCCTGGTGT TT ATACT AG AGT T ACGGCCTT GCGGGACT G 

GATT ACTT C AAAAACT GGT ATC TAAGAGACAAAAGCCT CAT GGAAC AGAT AACATT TT TTTT TGT TTT TT G 
: k?$ _ 

ff S GGTGTGGAGGCC AT T T TT AG AGAT AC AG AAT T GGAGAAGACTTGCAAAAC AGC T AGAT TTG ACT GATCTC A 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GT CATCTGTG AGCAAT AGTT GAAACT T TAT GT ACAT AG AGAAAT AGAT AATACAAT AT T AC 
ATTACAGCCTGTATTCATTTGTTCTCTAGAAGTTTTGTCAGAATTTTGACTTGTTGACATAAATTTGTAAT 
GC AT AT AT ACAAT T TGAAGC ACTC CT TT TCT T CAGTT C CT CAGCTC CT CT C ATTTC AGCAAAT ATCC AT T T 
TC AAGGTG CAGAAC AAGGAGTGAAAGAAAAT ATAAG AAGAAAAAAATCC C CT ACAT TT TAT T GGCACAGAA 
AAGT ATTAGGTGT TTTTCT T AGTGGAAT ATT AGAAATG AT CAT ATT CAT TAT GAAAGGTCAAGC AAAG ACA 
GC AG AAT ACC AAT CACTTC ATC ATT T AGGAAGT ATGGGAACT AAGT T AAG GAAGTC CAGAAAGAAGCC AAG 
AT AT AT CC TT AT TT TC ATT T CC AAAC AACT AC T ATGAT AAAT GTGAAG AAGATTCT GT TTTT TT GT GACCT 
ATAATAATTATACAAACTTCATGCAATGTACTTGTTCTAAGCAAATTAAAGCAAATATTTATTTAACATTG 
TT AC TGAGGAT GT C AACATAT AAC AAT AAAAT AT AAAT CACCC A 
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MMYRPDWRARKRVCWE P WVI GL V I F I S L I VIiAVC I GL T VH YVRYNQKKT YN Y Y S T L S FT T DKLY 
AE FGREASNNFTEMS QRLE SMVKNAF YKS PLREE FVKS QVIKFS QQKHGVLAHMLL ICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTASFGVTIKPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVTL I DATTCNE PQAYNDAI TPRMLCAGSLEGKTDACQGDS GGPLVS SDARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGTTCTCACTTTAACTGCAGAAACGAGGGTAGAGGAAGCAGTCATTTTGACTTACTTTCCT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
l-i GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
« TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
W CAGGCCCACCAGGAAGATCTCAGT GAC CT T TATCAAGAGGG T TGT GGGAAGAAAAT GTAT TCC TT 

Cm TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
|s| TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
If! ACAGACCAAAT GATGT C CT TGAAGAAT GACAACTCT CAGCACC TGTCATGTCCC TCAGTAGAAC T 

|;f GT TGAAACCAAGCCT GT CAAGAATC TT TGAACACACAT CCAT GGCAAACAGC T T TAAT ACACAC T 

:'f T T GAGAT GGAGGAGT T ATAAA AAGAAATGTCACAGAAGAAAACCACAAAC TT GT T T TAT T GGACT 

PI TGT GAAT TT T TGAG TACATACTATGTGT T TCAGAAATAT GTAGAAATAAAAATGT TGCCATAAAA 

i§ TAACACC TAAGCATATAC TAT TC TATGC T T TAAAATGAGGATGGAAAAGT TT CATGT CATAAGTC 

f~f ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
^ TCCATGCAAACGAGTCACATATGGTGGGAC TGGAGCCATAGTAAAGGTTGAT TTACT TCTACCAA 

C3 C TAG T AT ATAAAG T AC TAAT TAAAT GC T AACAT AG G AAGT T AGAAAATAC TAATAAC TT T TATTA 

f|| CTCAGCGAT C TAT T CT TC TGATGC TAAATAAATTAT AT ATCAGAAAAC T T TCAATAT TGGT GACT 

Q ACC TAAATGTGATT T T TGCT GGTTAC TAAAATAT T C TT ACCAC TTAAAAGAGCAAGC TAACACAT 

2% TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
TTCAGT TCTGATAATGTTAAGAATAACCAT TATGAAAAGGAAAAT T TGTCCTGTATAGCATCAT T 
AT T T T TAGCC T T TCCT GT TAATAAAGCT TTAC TAT T CT GTCC T GGGCT TATAT TACACATATAAC 
TGT TAT T TAAATACTT AACCAC TAAT T T TGAAAATTACCAGTGTGATACATAGGAAT CAT T AT TC 
AGAAT GTAGT CT GGTCT T TAGGAAGTAT TAAT AAGAAAATT TGCACATAACT TAGT T GATT CAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GC TTAT C TT T GCC T TCT CCAAACAAGAAGCAATAGT CT CCTiAGTCAATATAAAT TC TACAGAAAA 
TAGTGTTCTTTTTCTCCAGAAAAATGCTTGTGAGAATCATTAAAACATGTGACAATTTAGAGATT 
CTTTGTTT TAT T T CACT GAT T AATAT AC TGTGGCAAAT T AC AC AGAT TAT TAAAT T T T T T T ACAA 
GAGTATAGTATAT T TAT T TGAAATGGGAAAAGTGCAT T T TACT GTAT T T T GT GT AT T T T GT TTAT 
TT C TCAGAATAT GGAAAGAAAAT TAAAATGTGTCAATAAAT AT T TT C TAGAGAGTAA 
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m 



■5HW 
: «? ;: 



if 3 i 

5 \$ 



MMlEDSVKCLRCLLYALNLLFWIMSISVLAVSAWMRDYLNlSrV 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWTYEQELMVPVQWSDMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFIiGISIGVTQILMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRIFEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCC GTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAGGAACACTGACCACCAGTTGGAT GAGTCTCAAGGTCCTCCT CTATGT GACAACCAT GT GAAT GGGGAG 
T GGT ACC ACT T C AC G GGC AT GGC GGGAGAT GCC AT GC CT ACCT T CT GC AT AC C AGAAAACC ACT GT GGAACC CA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
T CAAT GGGAAC T GCT GT CT C T GGAAC AC C AC GGT G GAAGT C AAGGC T T GC C CT GGAGGC T AC TAT GT GT ATC GT 
CTGACC AAGC C C AGC GT CT GC TT C C AC GT CT AC T GT GGT CAT T T T T AT GAC AT CT GC GAC G AGG ACT GC C AT GG 
J*f CAGCT GCT C AGATACCAGC GAGT GCACAT GC GCTC C AGGAAC T GTGCTAGGCCCTGACAGGC AGACATGCTTT G 

f I AT GAAAAT GAAT GT G AGCAAAAC AAC G GT GGCTGCAGTGAGATCTGTGTGAACCTCAAAAACTCCTACCGCT GT 

?st GAGT GT GGGGT T GGC C GT GT GCT AAGAAGT GAT GGCAAGACT T GTGAAGACGTT GAAGGATGCC ACAATAAC AA 

fft T GGT GGC T GC AGC C ACTC T T GCC T T GGATCT GAGAAAGGC T AC C AGT GTGAAT GT C CC C GGGGC CT GGT GC T GT 

CTGAGGAT AACCACACTTGCCAAGTCCCT GT GTTGT GCAAATCAAAT GCCATT GAAGT GAACATCCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 

~? 

CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGTGGTGAATGACAAGATTGTGGCCAGCAACCTCGTGA 
|f| CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
Q ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
ft! CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 



GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 



TTGGAAAGCTTGGTGGAGAGCTGCTTTGCCACCCCCACCTCCAAGATCGACGAGGTCCTGAAATACTACCTCAT 
CC GGGAT GGCT GT GT T TC AGATGACT C GGT AAAGC AGT AC AC ATCCC GGGAT C AC C T AGC AAAGCACT T CC AGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
C GGT CT AC AGG GC C AGAC GCT AAC AGGC GGC C C GATC CGC AT C GAC T GG GAGGAC TAG T TC GT AGCC AT AC C T C 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGTATAATATTTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACTGAAATTATGA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
T T AT AGGGAAT T T GG AAGT GT AT C AAT AAAAC AGT AT AT AAT T TT 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNIL FSLECTCGT WDWNDKI VASNLVTGLPKQT PGSSGDFI IRTS KLLI PVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRDGCVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 

Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

Tyrosine kinase phosphorylation sites. 

amino acids 411-418, 443-451 

N-myristoylation sites . 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxy la ti on site. 

amino acids 197-209 

ZP domain proteins. 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCT T G CTCAGCGG AC AAGGATGCTG GGCGTGAG GGACCAAGGCCT GCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCT T GGGGT GACAAT CT C AGC TCCAGGCT AC AGGGAGACCGGGAGGAT C AC AGAGCC AGCATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
ATGGAGACCTTCAGAAAGGTGGGGATCCCCATCATCATAGCACTACTGAGCCTGGCGAGTATCATCATTGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCT GTGT GACGGAGAGC TGGACT GTCCCT T GGGGG AGGACGAGGAGC ACTGTGTC AAGAGCT TC 
H* CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
y AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
y GCT AC AGC AGAGCT GT GGAGAT TGGCCC AGAC C AGGATCT GG ATGT TGTTGAAATC AC AGAAAACAGCC AG 

3f s I: 

;f ? - GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 

#|| TC AGCATCCAGT AC GACAAACAGC ACGT CT GT GGAGGG AGCATCCT GGACCCCC ACTGGGTCCT C ACGGC A 

\f ■• 

m GCCC AC TGCT TC AGGAAACATACC GAT GTGT TCAACT GGAAGGTGC GGGCAGGC TC AGAC AAACTGGGCAG 

* CTTCCC AT CCCTGGCT GT GGCC AAGAT CAT CATCAT T G AATTC AAC CCCAT GTACCCC AAAGAC AATGAC A 

TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAG GAGCT C ACTCC AGCCACCCC AC TCT GGATC ATTGGAT GGG GC TTT ACG AAGC AGAATGGAGGG AA 
GATGTCT G AC AT AC TGCT GCAGGCGT C AGT CCAGGT CATT GAC AGC AC ACGGTGC AAT GC AG ACGATGCGT 



In 

if"! 
"? 

^ ACCAGGGG GAAGTC ACCGAGAAGATG AT GT GT GC AGGCAT CC CGGAAGGGGGTGTGGAC AC CTGCC AGGGT 

GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 

III 

CT GC GGG G GCCCGAGC AC CCCAGGAGT ATACACC AAGGTCTC AGC C TATCT C AACTGGAT CTAC AATGT CT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG ATCC CC C AAAGT C AGACAC AGAGC AAGAGTCC CCTT GGGT AC AC CCCT CT GCCC AC AGCCTC AGCAT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATT GCTAAGCC AAGAAGGAAC TT TCCC AC ACTACT GAAT GGAAGC AGGCTGTC TTGT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGT TGT C AT T GT TAT T ACAGCT AT GGCCACT ATT AT TAAAGAGC TGTGT AAC ATC T CT GGC AAAAAAAA 
AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIWVLIKVILDfGfYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEAIAETACRQMGYSRAVEI GPDQDLDVVE ITENSQELRMRNSSGPCLSGS LVSLHCIACGKSL 
KTPRWGGEEASVDSWPWQVS IQYDKQHVCGGS ILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKML 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
« CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
pi AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
li| GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
^ CAGAAGGGGTCT GC TTAGACCACC TGGTT TATGT GACAGGAC TTGCAT T CTCC TGGAACAT GAGG 

~" GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
fj GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
|f| CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAATT T AAAT CAT G T T C T AGTAAT T GGAGC T G T C C C C AAGAC CAAAG GAGC T AGAGC T T GG TT 
CAAATGATCTCCAAGGGCCCTTATACCCCAGGAGACTTTGATTTGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
T T T GGGAGGCCGAGGCGGGTAGAT CACC TGAGGT CAGGAGTT CAAGACCAGCC TGGCCAACAT GG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TACTCGGGAGGCTGAGACAGGAGAAT TACT TGAACCT GGGAGGTGAAGGAGGCTGAGACA 
GGAGAATCAC T TCAGCC TGAGCAACACAGCGAGAC TCTGT CT CAGAAAAAATAAAAAAAGAAT TA 
TGGTTATTTGTAA 



;: 
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FIGURE lid 



MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATCGC^ 
AGAATCCTCCAGAGAAT TGTGAAGACTGTCACAT TCTAAATGCAGAAGCTT TTAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
G T T T T GG GGGAG C AAG CAC T T C T GGC C GGAG G T AC CCAAAAAAGC C TAT GACAT G GAG C AC AC TT 
TC TACAGCAAT GGAGAGAAGAAGAAGAT TTACAT GGAAAT T GATCC T GT GACCAGAACT GAAATA 
T T CAGAAGCGGAAAT GGCACTGAT GAAACATT GGAAGT GCACGACT T TAAAAACGGATACACTGG 
M CATC TAC T T CGTGGGT CT T CAAAAAT GT T T TATCAAAACT CAGATTAAAGTGAT T CCT GAAT T TT 

C TGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAACTT TCTT TGAACAGTCAGTG 
Jjf ATTTGGGTCCCAGCAGAAAAGCCTATTGAAAACCGAGATTTTCTTAAAAATTCCAAAATTCTGGA 
| s | GAT T T GTGATAACGT GACCATGTAT T GGAT CAAT CCCACT C TAATATCAGT T TCT GAGT TACAAG 

Iff ACTTT GAGGAGGAGGGAGAAGAT CT T CACT T T CCTGCCAAC GAAAAAAAAGGGATT GAACAAAAT 

111 GAACAGTGGGT GGTCC CT CAAGTGAAAGTAGAGAAGACCC GTCACGCCAGACAAGCAAGT GAGGA 

AGAAC T TCCAAT AAAT GACTATAC TGAAAATGGAATAGAAT T TGAT C CCAT GC T GGAT GAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
Iff GGCTACTACCCATAT CCATAC TGC TAC CAAGGAGGAC GAGT CAT C TGT CGT GTCAT CAT GCC T TG 

Q TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTC TAA TAGGAGGTTTGAGCTCAAATGCTTAAAC 
ill TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
% f GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
TTATCACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYME I DPVTRTE I FRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVI P 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCCIYCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVICRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 

GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 

GG CCGGCT GC AT CGCGGCC ACC GGGAT GGACATGTGGAGC ACCC AGGACCT GT ACGACAACCCCGT CACCT 

CCGT GT TC C AGT AC GAAGGGCT CT GG AGGAGCTGCGT GAGGC AGAGTT C AGGCT T C ACCG AATGCAGGCCC 

TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 

GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 

CCAAAGCCAACATGACACTGACCTCCGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 

TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 

GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 

TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 

TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 

CT TC C AAGCACGACT AT GTG TAAT GCTCTAAGACCT CTCAGC ACGGGC GGAAGAAACT CC CGGAGAGCT C A 

CCCAAAAAAC AAGGAGATCC C ATCT AGATT T C TT CTTGCT TT TGACTC AC AG CT GGAAGT T AGAAAAGCC T 

CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 

GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 

T T CT ACT C TGATGAGAGAAT GT GGT T TT AATCTCT CT C TC AC AT T T T GAT GAT T T AGACAGAC T CC CCCT C 

TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 

GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 

ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 

AC ACTGT G AT CT T AAAAGTT ACCAAACC AAAGTCATT T TCAGTT TGAG GCAACCAAACCT TT CT AC TGCT G 

TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 

CGCGGGT C AG AAATTGT CCCT AGATGAATGAGAAAATT AT TTTT TT T AATTT AAGT CCTAAAT AT AGT T AA 

AAT AAAT AAT GTTTTAGT AAAAT GAT ACACT ATC TCT GTG AAAT AGCCT C AC CCCT AC AT GT GGAT AGAAG 

GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 

AT GAAAAGCT C ACACCTGT AAT CCT AGC ACTT TGGGAGGCTGAG GAGG AAGG AT CACT T G AGCCCAGAAGT 

TCGAGACT AGCCTGGG CAACAT GG AG AAGCCC TGTCTCTAC AAAAT AC AGAGAG AAAAAAT C AGCC AGT C A 

TGGT GGCATACACCTGT AGT CCCAGC AT T CCGGGAGGCTG AGGT GGGAGG AT C ACT TG AGCC CAGGGAGG T 

TGGGGCT GC AGT GAGCCATG AT CACACC ACT G CACTCC AGCC AGGT GAC AT AGCGAGATCCT GT CT AAAAA 

AAT AAAAAAT AAAT AAT GGAAC AC AGC AAGT C CT AGGAAGT AGGTT AAAACT AAT T CT TT AA 
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FIGURE 118 



MS T TTCQWAFLLS I LGLAGC I AATGMDMWS T QDLYDNPVT S VFQYEGLWRS C VRQS SGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLL^ 

CAIAGVSVFANMLVTNFWMS TANMYTGMGGMVQTVQTRYT FGAAL FVGWVAGGLT L I GGVMMC IA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 

V 

Signal peptide: 

amino acids 1-23 

Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 

AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 

GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 

GAAAACAACAT CGTGGTTTTT GAAAACT TCT GGGAAGGACT GT GGAT GAATTGCGT GAGGCAGGCTAA 

CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 

GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 

ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 

CATCACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTCTATA 

ACTCAATAGTGAATGTT GCCCAAAAAC GTGAGCTT GGAGAAGCT CTCTACT TAGGATGGACCACGGCA 

CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 

CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 

T CTACTCCAGAAGTCAGTATGTGTAGTT GTGTATGTT TTT TTAACTT TACTATAAAGCCAT GCAAATG 

ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAA.GAAACTTTGATTTACTGTTCTTAACTGCCT 

AAT CT T AAT T ACAGGAACT GT GC AT CAG CTAT T TAT GAT T C TATAAG C TAT T T CAGC AGAATGAGAT A 

TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 

CTCT TTTTATCATTTAC TT CAAAAT GACATT GCTAAAGACT GCAT TATT TTACTACTGTAATT T CTCC 

ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 

T TTATTTAAAAT GAAATGCCAGT CCATT ACACT GAATAAATAGAACT CAACTATT GCTTT T CAGGGAA 

ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 

T TTATAATGAAGATTAAAATGAAGGCT T TAATCAGCATT GTAAAGGAAATTGAATGGCTTT CTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 

CTT GT GT AT TAAAT T AACAT TTT T AAAAC GCAGAT AT TTT GT CAAGG G G C T T T GC AT TC AAAC T GCT T 

TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 

AAAATATTTTTGTTTTTGTATTTGAAGAAGAATGATGCATTTTGACAAGAAATCATATATGTATGGAT 

ATATTTTAATAAGTAT TTGAGTACAGACTTTGAGGTT TCATCAATATAAATAAAAGAGCAGAAAAATA 

TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 

GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 

CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 

ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 

TATACATT T ATAT TAATAAATTGTACAT T TT T CTAATT 
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FIGURE 120 

MATHALE I AGLFLGGVGMVGTVAVT WP 

I YDSLLALS PDLQAARGLMCAAS VMS FLAFMMAI LGMKCTRCTGDNEKVKAHI LLTAGI I FI I TG 
MWLIPVSWVANAIIRDFYNSIVNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 



Signal peptide: 

amino acids 1-17 



Transmembrane domains: 



u 

O 
Q 

m 

id amino acids 82-101, 118-145, 164-188 

i 

If! 

m 

o 
in 

13 
rii 
Q 
rii 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
C T CCGGCAGAGGGAGGT GGT GGACCT GTATAAT GGAAT GT GC TTACAAGGGCCAGCAGGAGT GCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
iy :r GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
p AAGCAGTG T TCAT GGAGTT CAT T GAAT TATGGCAT AGATC T T GGGAAAATT GCGGAGT GT ACAT T 

S TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
fl GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
?jf AT TGAAGC TATAAT T TATT T GGAC CAAGGAAGCCCTGAAAT GAAT TCAACAAT TAAT AT TCAT CG 

fl% CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
f|| TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
* AT TAT TGAAGAACTACCAAAATAAATGC TT TAAT T TT CATT TGC TACC T CT T TTT T TAT TAT GCC 

TT GGAAT GGTTCACT TAAAT GACAT T T T AAAT AAG T T TAT G TATACAT C T GAAT GAAAAGC AAAG 
| C J| C TAAATAT GT T TACAGACCAAAGT GT GATT TCACACT GT TT T TAAATC TAGCAT TAT T CATT T TG 

f\l C T T C AAT C AAAAG T GG T T T C AAT AT T T T T T T T AGT T G G T T AG AAT AC T T T C T T CAT AG T C AC AT T 

Q CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
nl AAAAAATATAAAAGCT ACCAATC T T TGTACAATT T GT AAAT GT TAAGAAT t T T T T T TATATCT GT 

TAAAT AAAAAT TAT T T CCAACA 
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FIGURE 122 

MRPQGPAAS PQRLRGLLLLLLLQLPAPS SASE I PKGKQKAQLRQRE VVDLYNGMCLQGPAGVPGR 
DG S PGANVI PGT PG I PGRDG FKGEKGECLRE S FEE SWT PNYKQCS WS S LNYG IDLGKI AE CT FTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
ATTGCCTGGAAGAATACATCATGTTTTTCGATAAGAAG7^AATTGTAGGATCCAGTTTTTTTTTTA 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
T C T T CC TGGGTGT CAGCGAGCCC T GAC TCAC T ACAGTGCAGC TGACAGGGGC T GT CAT GCAACTG 
GCCCC TAAGC CAAAGCAAAAGACC TAAGGACGACCT T T GAACAATACAAAGGATGGGT T T CAATG 
TAAT TAGGC T AC TGAGCGGATCAGCT GTAGCAC T GGT TATAGCCCCCACT GT C T TACT GACAAT G 
i. t CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
IZ ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
|:f ATAACAGCC T T CAAAAACT TAAGTATAATCAAT T T AAAGGGC TCAACCAGC TCACC T GGCT AT AC 

O C T TGACCATAACCAT AT CAGCAATAT TGACGAAAAT GC T T T TAATGGAATACGCAGACT CAAAGA 

ffl GC TGAT TC T TAGT TCCAAT AGAAT CT CC TAT T TT CT TAACAATACC TTCAGAC CT G T GACAAAT T 

Id TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
l?| CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
I? 1 - TCTTTGCT GGCAT GAT CAGAC T CAAAGAAC T T C ACC T GGAGCAC AAT CAATT T T C C AAGC T C AAC 

CO CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
, CATAGGACAGACCATGTCCT GGACCTGGAGCTCCTTACAAAGGCT TGAT T TAT CAGGCAAT GAGA 

TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACATT TATT GGTCAAGAGATTTTGGAT TCTTGGATATCCCTCAAT GACATCAG 
VI TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 

O ttaaaggtctaagggagaatac^ttatctgtgccagtcccaaagagctgca^ 
flj atcgatgcagtgaagaactacagcatctgtggcaaaagtactacagagaggtttgatctggccag 
7£ ggctctcccaaagccgacgtttaagcccaagctccccaggccgaagcatgagagcaaaccccctt 
tgcccccgacggtgggagccacagagcccggcccagagaccgatgctgacgccgagcacatctct 
PJ ttccataaaatcatcgcgggcagcgtggcgcttttcctgtccgtgctcgtcatcctgctggttat 
ctacgtgtcatggtvagcggtaccctgcgagcatgaagcagctgcagcagcgctccctcatgcgaa 
ggcacaggaaaaagaaaagacagtccctaaagcaaatgactcccagcacccaggaattttatgta 
gattataaacccaccaacacggagaccagcgagatgctgctgaatgggacgggaccctgcaccta 
taacaaatcgggctccagggagtgtgaggtatgaaccattgtgataaaaagagctcttaaaagct 
gggaaataagtggtgctttattgaactctggtgactatcaagggaacgcgatgccccccctcccc 
ttccctctccctctcactttggtggcaagatccttccttgtccgttttagtgcattcataatact 
ggtcattttcctctcatacataatcaacccattgaaatttaaataccacaatcaatgtgaagctt 
gaactccggtttaatataatacctattgtataagaccctttactgattccattaatgtcgcattt 
gttttaagataaaacttctttcataggtaaaaaaaaaaa 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLS LHLRSNSLRT I PVRI FQDCRNLELLDLGYNRIRS 
LARIWFAGMIRLKELHLEHNQFSKLNLiALFPRLVSLQNLYLQWNKISVIGQTMSW 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT I I CAS PKELQGVNVI DAVKNYS I CGKST TERFDLARALPKPT FKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-4 68 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGA ATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGCT T TCTAAAGCT T TGGCAAGGAGT GACACCCGCCAT TTACAGACAC GTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGC CAATCCAAC TGACCT AGTGAAGGT TCAGAT GCAAAT GGAAGGAAAAAGGAAACTGGAAGG 
AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
GAGGGCTTTGGGCAGGCTGGGTACCCAATATACAAAGAGCAGCACTGGTGAATATGGGAGATTTA 
ACCACTTATGATACAGTGAAACAC TACTTGGTAT TGAATACACCACTTGAGGACAATATCATGAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAATGAAT CAACCACGAGATAAACAAGGAAGGGGACTT TTGTATAAAT CATCG 
AC TGAC TGC T TGAT TCAGGC TGT T CAAGGTGAAGGAT TCAT GAGT CTATATAAAGGC TTT T T ACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTT TAA 
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FIGURE 126 

MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEMI^LGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VIGGMMAGVIGQFIANPTDLVK^ 

NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

s Transmembrane domains: 

O amino acids 25-38, 130-147, 233-248 

gi 

ill 



ril 
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FIGURE 127 

CGCGGATCGGACCCT^AGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACAT CCT CT CCATGT TC TCAGC TAT C CGT TC TCAGCACAGCGGT G TAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT TCGCCTTCAAACTCCACGACAAGGAC CC T GAGAAGGCAGC TGCCACCTATGAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGAT TGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGAC T GT GGGAGCT CC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 



ID=10063568 



Page 277 



FIGURE 128 



MARPGMERWRDRLALVTGASGGIGAAVARALVQQGLKWGCARTVGNIEELAAECKSAGYPGTLI 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVWLALSICTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
I S PGWETQFAFKLHDKDPEKAAAT YEQMKCLKPE DVAEAVI YVL S T PAH I Q IGD I QMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase. 

amino acids 30-42, 104-114 
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FIGURE 120 

AACTTCTAC ATGG GCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATT TT TAACTGATAGT TGTACATAT TTGGGGGTACATGT GATATTTGGATACATGTATACAA 
TATATAAT GAT CAAATCAGGGTAACT GGGATAT CCAT CACAT CAAACAT T TAT T T T T TAT TC T TT 
T TAGACAGAGT CT CACTCT GT CACCCAGGCTGGAGTGCAGT GGT GCCAT CT CAGCT TAC T GCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T T C CAT C CAT G T T GC T GC AAAT G ACAG GAT TTCGTTCT T AAT T T C AAT T AAAAT AAC C AC AC AT G 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGES PMPFKRVFCQDGNVRS FCVCAVHFS SHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 8 6-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE 131 



TT CT GAAGT AACGG AAGCTACC TTGT AT AAAG ACCTCAAC ACTGCT GACCATGATC AGCGCAGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGT C GC TG CGAT GC GGGT TTC ATT T ACT GT AATGAT C GCT TT CTGACATCCATT CC AACAG 
GAAT ACCAGAGG AT GC TACAAC TCTCTACCT T CAGAAC AACCAAAT AAAT AATGCT GGGAT T CCT TCAGAT 
TT GAAAAACT TGCT GAAAGT AGAAAGAAT AT ACCT AT ACC ACAACAGTTT AG ATGAAT T TCCT ACC AACCT 
CCCAAAGT AT GT AAAAGAGT T ACAT TTGCAAGAAAAT AAC AT AAGGACT ATC ACTT AT GAT T CACTTT CAA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
12 TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
2; GCCCAGGACTATAGAAGAACTACGCTTGGATGATAATCGCATATCCACTATTTCATCACCATCTCTTCAAG 
GTCT CACT AGT CT AAAAC GCCT GGT T CT AGAT GGAAAC CT GTTGAACAAT CATGGT TT AGGT GACAAAGTT 
I t | TT CT TCAACCT AGT T AAT TT GACAGAGC TGT C CCTGGT GC GGAATT CCCT GACT GCTGC ACCAGT AAACCT 

Iff T C C AGGCAC AAACCTG AGGAAGCTT T AT CTT C AAGAT AAC C ACATC AATCGGGTGCCCCCAAAT GCTT TT T 

fl! CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 
CO GATG AT TT GG AC AAT AT AACACAACT GATTCT TCGCAACAAT CCCT GGTATT GCGGGT GC AAGATGAAAT G 



* GGTACGTGACTGGTTACAATCACTACCTGTGAAGGTCAACGTGCGTGGGCTCATGTGCCAAGCCCCAGAAA 
AGGTTCGT GGGAT GGCT ATT AAGG AT CT C AAT GCAGAACT GTTT GATTGT AAGGACAGTGGGATTGTAAGC 



O 



ACCATT CAGAT AAC C ACT GC AAT ACC C AAC AC AGTGT ATC CTGCCC AAGG AC AGTGGCC AGC TC CAGTGAC 
CAAACAGCCAGATATTAAGAACCCCAAGCTCACTAAGGATCAACAAACCACAGGGAGTCCCTCAAGAAAAA 
C AAT TACAAT TACT GT GAAGTCTGT C AC CTCT GAT ACC AT TC AT AT CTCT TGGAAACT TGCTCT ACC T AT G 
ACT GCT TTGAGACT C AGC TG GC TT AAAC TGGGCC AT AGCC CGGCAT TTGG AT CT AT AAC AGAAACAATT GT 
AACAGG GG AACGCAGT GAGT AC TTGG TC AC AGCCCTGG AGCCTGAT TC ACCC T AT AAAGT AT GCATGGTT C 
CC AT GG AAACCAGC AACC T CT ACCT ATT TGAT GAAACT CC TGTT TGT AT T G AGACT GAAACT GCACCC CT T 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGCTGGCACT AAGAAGGAC AACTCTATCCTGGAAATCAGGGAAACTTCTTTTCAGAT GT T ACC AAT 
AAGC AAT G AACCCATCTCGAAGGAGG AGTTT GTAAT AC AC ACC AT AT TTC CT CCT AATGGAATGAATC TGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TCACACTCATGATGCTGAAGGACTCACAGCAGACTTGTGTTTTGGGTTTTTTAAACCTAAGGGAGGTGATG 
GT 
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FIGURE 132 



5„; : .; 



5 = 
iU3 



HI 



MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGIPSDLKNLLKVERIYLYHNSLDEFPTNLPKYVKELHLQENNIRTITYDSLSKIPYL 
EELHLDDNS VSAVS IEEGAFRDSNYLRLLFLSRNHLST I PWGLPRT IEELRLDDNRI ST I S SPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMS^^LSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVST I QITTAI PNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKT I T I TVKSVTSDT IHI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI IGGAVALVT IALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILEIRETSFQMLPISNEPISKEEFVIHTIFPPNGMNLYKNNHSESSSNRSYRDSGIPDSDHSHS 



g| Important features of the protein: 

|f| Signal peptide: 

* amino acids 1-28 

Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 
Q amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 

?*# Tyrosine kinase phosphorylation site, 

amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGT C T GGGCT T GGCCAGAGCT CAGGGTGC TGAGCGTGTGACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAA.GAGCAGCTAA?\AAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
III TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
iPf CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
|f| GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
W CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
5 GACCAC TCCAACAGAGCCAGGAC TATAT CAACCT CT TC TGCGCCAACAT GAT GGAC T TGAACC GC 

Q AGAGC T GAGGCCATCGGAT ACGCCTACCCTACCCGGGACAT CT TCATGGAAAACAT CAT GT TCTG 

yl TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
Sr? AACAGCAGGAAGGATGCT T C GGGGAGCC TGAT GC T GAAGAT GAAGAAT TATC TAAAGCTAT T CAA 

« TATCAGCAGCAT T T T TCGAGGAGAGTGAAGAGGCGAGAAAAACAAT T TC CAGATT C TCGCTCTGT 

ill TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T TAGT CC TCAT C CCT TAGAT CC TGGAGGGCACGGAT CACAT CCTGGGAAGAAGGCATC TGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAA&AAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKAT I ADLI LSALERATVFLEQRLPE INLDGMVGVRVLEEQLKS VREKWAQE PLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
S SWDYRS VP PYLANFY I FLVE TGFHHVAHAGLELL I SRDPPT S GS QSVGL 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 



fy Transmembrane domain: 

* amino acids 39-56 



Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 



^1 

ss* N-myristoylation sites. 



amino acids 10-16, 20-26, 63-69, 208-214 

Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC T TACAGAT T TGATC CCGT TCGAGTGGATATCACT T CGAAAGGAAAAATGAGAGCAAGATAT G 
TGAAT TACATCAAAACATCAGAGGT TGT CAGACT GCCCT AT CC TC TCCAAATGAAATCT T CAGG T 
%l CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
fll TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
||1 C T GACATGAGACGGGAAATGGAGCAGTCAATGAATAT GC TGAATTCCAACC ATGAGT TGCCT GAT 

W GTT TCTGAGT TCAT GACAAGACTCT TCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 

TAAAACAGGCAAAAGT GGGGCTGGCAAAAGGAGG TAGT CAGGCCGTCCAGAGC TGGCAT T TGCAC 
%l AAACACGGCAACAC TGGGT GGCAT CCAAGTC T TGGAAAACCGT GTGAAGCAACTACT AT AAACT T 

, GAGT C AT CCCGACGT T GAT CT C T T AC AAC T GT GT AT GT T 

Q AAC TT T T TAGCACATGT T T T GTAC T TGGTACACGAGAAAACCCAGC T T TCATC T T T T GT CT GT AT 

4fl GAGGTCAATAT T GAT GT CAC T GAAT TAAT TACAGTGTCC T ATAGAAAATGCCAT TAATAAAT TAT 

^ ATGAACTACTATACATTATGTATAT TAAT TAAAACATCTTAATCCAGAAATCAAAAAAAAAAAAA 

:*Z AAAAAAAAAAAAAAA 

W 

III 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQS SEVPGAAAEGSGGSGVGI GDRFKI EGRAWPGVKPQDWI SAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYWEW 

WRL P YPLQMKS SGPPS Y FI KRES WGWTDFLMNPMVMMMVLPLL I FVLLPKWNTS DPDMRREME 
QSMJJMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



m 



Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 



s amino acids 18 4-187 

m 

Jj[ Glycosaminoglycan attachment sites. 

Jl« amino acids 37-40, 236-239 

f|| cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 



Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 



APP 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CT TACCTGCTGGGCACTAACGGCGGAGCCAGGAT GGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
3 . CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
i** AAGC TGTGT GATCGCCACAAACC T T CAGGAAATAC GAAAT GGATT TTCT GAGAT ACGGGGCAGT G 

TGCAAGCCAAAGATGGAAACAT TGACATCAGAATCT TAAGGAGGAC TGAGT C T T TGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 

?** AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 

|I1 

CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 

5; 

fQ AAGAAATACAGCCAGATTCTGAGTCACTTTGAAAAGCT GGAACCTCAGGCAGCAGT TGTGAAGGC 

s TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGA ATAG GAGGAAAGTGATGCTG 
13 CTGC TAAGAAT AT T CGAGGT CAAGAGC TCCAGTC TT CAATACC TGCAGAGGAGGCATGACCCCAA 

'fj ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
5t CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCT T T CT GC TATT GGATATAT T TAT TAGT TAATATAT T TAT T TATT T T T T GCT AT T TA 
ill ATGTATTTATTTTT TTACTT GGACATGAAACT TTAAAAAAAT TCACAGATTATATTTATAACCTG 

ACT AGAGCAGGT GATGTATT T T TATACAG TAAAAAAAAAAAAC CT T GTAAAT TC TAGAAGAGTGG 
CTAGGGGGG T TAT T CAT T TGTAT TCAAC TAAGGACATAT T T AC TCAT GCT GATGCTC TGT GAGAT 
AT T T GAAAT T GAAC CAAT GAC TACT TAGGATGGGT T GT GGAATAAGTT T T GATG TGGAAT T GCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCC TACACGGCCAGCATGTAT T TC T ACAAATAAAG T T T T CT T T GCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMS FSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SIANSFLTIKKDLRLSHAmTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 
amino acids 1-42 

cAMP- and cQylP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites . 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGT GT GAAATT TATGCCC T ACACAAC T TACCT T GTGGAAAAAGGAGCAT CTCACAGT 
TTTGTGGCT GAAGCAAAGCCACC TACAGT TAC TAT GCCT CGAATCAAGGCAT TGAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGT C T T T TATGGAGATGAAACC TGGGTT AAAT TA 
T TCCCAAAGCAT T T TGT GGAATAT GATGGAACAACCTCAT T T T TCGTGTCAGAT T ACACAGAGGT 
H GGATAATAATGT CACGAGGCAT T T GGAT AAAGTAT TAAAAAGAGGAGAT TGGGACATAT TAAT CC 

Q TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
Q AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
ffi GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
ill GGGCC T CCTCCACC GAGGAGGTGAATACACC TCT GAT T T TAATCAG TTCTGCGTTT GAAAGGAAA 

J;f CCCGGTGATATCCGACATCCAAAGCACGTCCA ATAGA CGGATGTGGCTGCGACACTGGCGATAGC 
41! ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
HI CAATGAGAGAGCAGT T GAGAT T T T TACAT T T GAAT ACAGTGCAGC T TAGTAAAC TGT TGCAAGAG 

fi% AATGTGCCGTCATAT GAAAAAGATCCT GGGT T T GAGCAGT T TAAAATGTCAGAAAGATT GCAT GG 

GAAC TGGATCAGACT GTACT T GGAGGAAAAGCAT T CAGAAGT CCTATT CAACC T GGGCT CCAAGG 
* TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
O TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
f|| CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
f% CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
IZ GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
lS2 AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
I J TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
f|| CAAGGT GAT T GT AAAGAGCT GGCGGTCACAGAGGAACAAGCC CCCCAGC T GAGGGGG T G T GT GAA 

TCGGACAGCCT CCCAGCAGAGGT GTGGGAGC T GCAGCT GAGGGAAGAAGAGACAAT CGGCC T GGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CT GACCGAGACAC T CACAGC TT TGTCAT CAGGGCACAGGCT T CCT C GGAGCCAGGAT GAT C T GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAG TAT GTAGT TACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 
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FIGURE ldo 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSAFIAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWD I L I LHYLGLDH I GH I SGPNS PLI GQKLSEMDS VLMKIHT S LQSKERET PLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 

Important features of the protein: 
Signal peptide: 
amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCT T CCAGGT TATCG TGACGCACC T T GAAAGTC TGAGAGC T AC TGCCC TACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACC TTAAGACCACTCACACCTTCAGAGTGAAGAACT TAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGT TCCAGATAAAAACTACATACGCCCAGAGATCTT CT T TGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCAT CCATCCCT TCAGCT GAAGAAGGAGAAACT GAT GAAGCT 
III GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
P| ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
01 GT TGGGGTGACAGATAAAT T T GAGAACAGGAAACACAT T GAAT TT TCAT T TCAACCAGT T TGCAA 

W AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALAS S LS SASAEKGS P I LLGVSKGE FCL YCDKDKGQSHPS LQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 



N-myristoylation site. 

CJ amino acids 50-55, 87-92 

%n 

T:i Interleukin-1 

jU' amino acids 37-182 



.7. J. ::, 

11% 
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FIGURE 143 

CTAGAGAGTATAGGGCAGAAGGATGGCAGATGAGTGACTCCACATCCAGAGCTGCCTCCCTTTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGT TTGCTGTCCACACAT TCAACCAACAGAGCAAGGACTACTATGCCTACAGACT GGGG 
f*% CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
Q GGGGAGAAC TAGGT GT GGGAAAT T T GAAGACGACAT T GACAAC TGCCAT TTCCAAGAAAGCACAG 

Cfl AGC TGAACAATACT TT CACCTGCT TC TT CACCAT CAGCACCAGGC CC T GGATGACT CAGT T CAGC 

fi, CTCC TGAACAAGACCT GCTT GGAGGGAT TCCAC TGAGT GAAACCCAC TCACAGGCT T GT CCAT GT 

V*l GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
m TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
? ACACAT CTTGAGCCTAATCATGTAGTG TAGAT CAT T AAACAT CAGCAT T T TAAGAAAAAAAAAAA 

O AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
III 
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FIGURE 144 

MLGLPWKGGLS WALLLLLLGSQI LL I YAWHFHEQRDCDEHNVMARYLPATVE FAVHT FNQQSKDY 
YAYRLGH I LNS WKE QVESKT VFSMELLLGRTRCGKFEDD I DNCHFQE S TELNNT FT C FFT I STRP 
WMTQFS LLNKT CLE G FH 



Important features of the protein: 
Signal peptide: 

= aL amino acids 1-25 

O N-glycosylation sites. 

pi amino acids 117-121, 139-143 

ffl N-myristoylation site. 

Ill amino acids 9-15 



----- 
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FIGURE 145 

CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCTTGCCAGGAGGCT GGCAGTACAGAGCGCAGCAGCGAGCAAATCCTGGCAAGT GACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CC GTCAC TCCAT T CTCAGCCC CTAGCAGAGCGTC T GGCACACTAGAT TAGTAGTAAATGCT T GAT 
GAGAAGAACACAT CAGGCACT GCGCCACCTGCT TCACAGTAC T TCC CAACAAC TCT TAGAGG TAG 
GTGTAT TCCCGTT TTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAAT TCCAGAGCT GT CT GGTGATCACT T TAT GTCT CACAGGGACCCACAT CCAAACATGTAT 
C TC TAATGAAAT T GTGAAAGCTCCATG T T TAGAAAT AAAT GAAAACACC T GA 
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FIGURE 146 



MRKHLSWWWIATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCE FWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 157-171 




N-glycosylation sites. 

amino acids 98-102, 110-114 



Tyrosine kinase phosphorylation site. 



amino acids 76-83 



e 
III 



ft J 



N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 



amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTCATGTTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC TT TCAGAT T CAGGGGG TACATGTGAAGGTT T GT T T TAT GAGTATATT GCATGAT GC TGAGG 
TTTGGGGT 
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FIGURE 148 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



f*% N-myri s toyl a ti on si te . 

O amino acids 62-68 

m 
hi 
in 
cn 

CO 

o 
If] 
o 
ru 
o 
ru 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGT CAGAGCAACTATT CTT T 
^ TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 

o 

7Z TAAGAAGCT CCCCACTTGATAAT AAGTT GAAT GTGGAAGAT GT TGATT CAACCAAGAATCGAAAACTG 

m ATCGATGAT TATGACTCTACTAAGAGT GGAT TGGATCATAAAT TT CAAGAT GATCCAGAT GGTCTTCA 

|;| T CAACTAGACGGGACTCCT TTAACCGCT GAAGACATT GT C CAT AAAAT CGCTGCCAG GAT TTAT GAAG 

U| AAAAT GACAGAGCCGTGTT TGACAAGAT TGT TTCTAAACTACT TAATCTCGGCCT TATCACAGAAAGC 

IP CAAGCACATACA.CTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 

m 

TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
1, TGACTCCAATGGCAGCAAT TCAAGATGGTCT TGCTAAGGGAGAAAACGATGAAACAGTATCTAACACA 

T n TTAACCTT GACAAATGGCT TGGAAAGGAGAACTAAAACCTACAGT GAAGACAACTTTGAGGAACTCCA 

Q ATATTTCCCAAATTTCTATGCGCTACTGAAAAGTATTGATTCAGAAAAAGAAGCAAAAGAGAAAGAAA 
F|j CACTGATTACTAT CATGAAAACACTGATTGACTTT GTGAAGATGATGGTGAAATATGGAACAATATCT 

O C CAGAAGAAGGT G TT T C CT AC CTT GAAAACT T G GAT GAAAT GAT T GC T CT T CAGACC AAAAACAAGC T 

rtJ AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
C AGAC AGTACCAAG GAAGAAGCAGC T AAGAT GGAAAAG GAAT AT G GAAGC T TGAAGGAT T C CAC AAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAATATT GAATGGT TGAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GAC CTT T 
CAAAGAT GAGAGACTT CAT CAAT AAACAAGCTGAT GC TTAT GTGGAGAAAGG CAT CCTT GACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTT CAGAAAACATAAT AT AGCT TAAAACACTT CTAATTCTGT GATTAAAATT T TTT GACCCAAGG 
GTTATTAGAAAGT GCT GAATT TACAGTAGTTAACCTT TTACAAGT GGTTAAAACATAGCTT TCTTCCC 
GTAAAAACTAT CT GAAAGTAAAGT T GTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEAWNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 



p| N-myristoylation sites: 

f|| amino acids 143-148, 239-244 
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FIGURE 151 



CGGCT CGAGGCT CCCGCCAGGAGAAAGGAACATT CT GAGGGGAGT C TACACCC T GT GGAGC TCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGC T T CTAGC T GGAGGGC T GCAT GCAGGGAAGGT CAT TAAAGGTGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCAT GGCGGACAAT CAC TCT CT CT GC TC T CAGGACCCCCACGT CT GACT TAG 
H TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
O CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
Q CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
f I GGG TCAGTAGCTCT CCACAT GAAGT CCT GTCACTCACCACT GT GCAGGAGAGGGAGGTGGT CATA 

W GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Iff TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
ffl GAAAT GGC T CGAGCTCAGAAGATAAAAGATAAGTAGGGTAT GC TGAT CC TCT T T T AAAAACCCAA 

£| GATACAATCAAAAT CCCAGATGC TGGTC TC TAT TCCCAT GAAAAAGTGC TCAT GACATATT GAGA 

AGACC TAC T TACAAAGT GGCATATAT TGCAAT T TAT T T TAAT TAAAAGATACC T AT T T ATATAT T 
TC T T TATAGAAAAAAGT C TGGAAGAGT T TACT T CAATT GT AGCAAT GTCAGGGT GGT GGCAGTAT 
Q AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
Iff AT TCCT TGTATAAAAATAAGAAAAGAAATTAATCTTGAGGTAAGCAGAGCAGACAT CATCTCTGA 

f4 TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
Jgjj TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
yf GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAA2\ATCATCTGGTAATTCTTTCCTAGAAG 
GAT CAC AGC C C C T G GGAT T C CAAGGC AT T GGAT C CAGT C T C T AAGAAGGC TGC T G TAC T GG T T GA 
ffj ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTC TC T GCAGAT GTAGT TAGT TAAGACAAGGT CAT GCT GGAT GAAGGTAGAC C T AAAT TCAATAT 
GAC TGGT T TCCT TGTAT GAAAAGGAGAGGACACAGAGACAGAGGAGACGC GGGGAAGAC T AT GTA 
AAGATGAAGGCAGAGAT CGGAGT T T TGCAGCCACAAGC TAAGAAACACCAAGGAT T GT GGCAACC 
AT CAGAAGC T TGGAAGAGGCAAAGAAGAAT TC TT CCCTAGAGGCT T TAGAGGGATAACGGCT CTG 
CTGAAACCTTAATCTCAGACTTCCAGCCTCCTGAACGAAGAAAGAATAAATTTCGGCTGTTTTAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATAT GGCAGAAGT GAT GGCAT GCCACT T CCAAGAT T AGGTTATAAAAGACACT GCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACCTAGAAGCAGAGACTCTGTGAGATAATCGATGTTTGTTGTTTT 
AAG T TGC T CAGT T T T GG T C T AAC T T GT TAT GCAGC AAT AGAT AAAT AATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 



N-myristoylation sites . 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

u 

fii Interleukin-1 signature. 

%J amino acids 111-131 

III 

3«s Interleukin-1 proteins. 

m amino acids 8-29, 83-120, 95-134, 64-103 

m ^ 



:1 
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FIGURE 153 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TT CCAGCAGCCCTATATCACCAACCGCACCT T CAT GCT GGCT AAGGAGGC TAGCT T GGCT GATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
, . CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
U % TATTGAAGGTGATGACCTGCATATCCAGAGGAATGTGCAAAAGCTGAAGGACACAGTGAAAAAGC 
O TTGGAGAGAGT GGAGAGATCAAAGCAAT TGGAGAAC T GGATT T GCTGT TTATGT CT C TGAGAAAT 

£p GCC TGCATT TGAC CAGAGCAAAGCTGAAAAAT GAATAACTAACCCCCT TTCCC TGCTAGAAATAA 

CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGT GGATT CCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTT TATA 
AGAC C AGAAG G TAG AC T T T C T AAGCAT AGAT AT T TAT T G AT AAC AT T T CAT T G T AAC TGGTGTTC 
« TATACACAGAAAACAAT T TATTTT TTAAATAAT T GTCT T T TT CCATAA&AAAGAT TACTT TCCAT 

U TCC T T TAGGGGAAAAAACCCCTAAATAGC T T CAT GT T T CCATAAT CAGTACT T TAT ATT TAT AAA 

Iff 

Jlf TGTAT T TAT TAT TATT ATAAGACT GCAT T T TAT T TAT AT CAT T T TAT TAATAT GGAT TTAT T TAT 

p! AGAAACATCAT T CGATAT TGCTAC T TGAGTGT AAGGCTAAT AT TGATAT T TAT GACAATAAT TAT 

Q AGAGC TATAACATGTT TATT T GACCTCAATAAACACT TGGATATCCC 

ru 



si 

CO 
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FIGURE iza 

MAALQKSVSSFLMGTLATSCLLLLAL^ 

ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCH I EGDDLH I QRNVQKLKDTVKKLGE SGEIKAI GELDLL FMS LRNAC I 



P 



in 



m 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 



Q N-glycosylation sites* 

111 amino acids 54-58, 68-72, 97-101 



N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE iss 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACC CAGAT TAGG TGAGGACAGT T C T CT CAT TAGCCT TT TCCTACAGGTGGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 



fit TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 

•W* gctgctctaccacaaccagactgtcttctacaggcggccatgccatggcgag^^ 

agggctactgcctggagcgcaggctgtaccgtgtttccttagcttgtgtgtgtgtgcggccccgt 
gtgatgggctagccggacctgctggaggctggtccctttttgggaaacctggagccaggtgtaca 
accacttgccatgaagggccaggatgcccagatgcttggcccctgtgaagtgctgtctggagcag 



CO 



U caggatcccgggacaggatggggggctttggggaaaacctgcacttctgcacattttgaaaagag 

If! 

o 



iff 

v» cagctgctgcttagggccgccggaagctggtgtcctgtcattttctctcaggaaaggttttcaaa 



gttctgcccatttctggaggccaccactcctgtctcttcctcttttcccatcccctgctaccctg 
fl gcccagcacaggcactttctagatatttcccccttgctggagaagaaagagcccctggttttatt 

f|| TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTT7VCT7VGTCTT 
TTGACATGGATGATTCTGAGGAGGAAGCTGTTATTGAATGTATAGAGATTTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 



MRERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATCTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGT TCAATGT GGCT C TGAAAC TGGGCCAT CT C CAGAGT GGATGC TACAACAT GATC TAATCC 
CCGGAGACT TGAGGGACCTCCGAGTAGAACCT GT TACAACTAGTGTTGCAACAGGGGACTATTCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT T TGTGT GACGGGCAAAAGCAACT TCCAGT CC TACAGCTGTGTGAGGTGCAAT TACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTT CATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGAT GG 
CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
AGT G T G T CAAGGC C GGAAGC C T GT G GGATC C GAACAT CACT G C T T G T AAGAAGAAT GAGGAGACA 
GTAGAAGTGAACT TCACAACCACTCCCC TGGGAAACAGATACATGGCTCT TATCCAACACAGCAC 
ftj TATCATCGGGT TT TCTCAGGTGTT TGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 

TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
AGTGGCAGAAAAAGAAAATAGCAGAGATGGGT CCAGTGCAGTGGCT TGCCACTCAAAAGAAGGCA 
GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
C TGTGCAGAACT TC TCCATGT CAAGCAGCAGGT GT CAGCAGGAAAAAGAT CACAAGCCTGC CACG 
ATGGCTGCTGCTCCTTGTAG 
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MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPWTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKK^^ 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGI YLMWRHERI KKTS FSTTTLLP PI KVLWY PSE IC FHHT ICY FT E FL 
QNHCRSEVILEKWQKKKIAEMGPVQWLATQKKAADKWFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 
amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

N-glycosylation sites. 

amino acids 67 - 71, 103 - 107, 156 - 160 r 183 - 187, 197 - 201 and 283 
- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE izq 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
III GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
AC T CAGC TGAAGAAG 
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MTWTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGC S VS FQLEKVLVTVGC TC VT PVIHHVQ 

Signal sequence: 

amino acids 1-30 



1*4 

r s | N-glycosylation site. 



amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC TGGCCAAACAAAAACGAAAGCAC TCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGATGAGGAAAAGT T T GGAGGAGCAGC T GACT CAGGGG T GGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
^ TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
O TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
Q GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
ff| ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
ft GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
W CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
Ifl CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
ff| CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
fil GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
0 CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
Ifl GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
f% CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
'HI CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
i ? | CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
fU CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 



APP H>1 0063568 



Page 311 of 318 



FIGURE 162 



m 



'zap 
If -i 



3 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCEILWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAMiVQFGQSVGSVVYrx:FEAALGSEVRIWSYTQPRYEECELNHTQQLPALPWLNVSADGDNVHLVLOTS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQ CLQLWDDDLGAL WAC PMD KY I HKRWAL VWL ACLL FAAAL SL I LL LKKD HAKG WLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQ PRAPRSGRLQERAEQVSRALQPALDS Y FHPPGT PAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCC TGACGTGGGACAGCGGGCCAGAGGGCACCCCAGACAC GGTC TACAGCAT CGAGTATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCT GACGG T GGAGACGGGCAACCTCACGGAGC T CTAC TAT GCCAGGGTCAC CGC T 
G TCAGTGCGGGAGGCCGGT CAGCCACCAAGATGAC TGACAGGT TCAGC T CTC TGCAGCACAC TAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TCTACCAC TTAGAGCTCCAGGTCAACCGCACCTACCAAATGCACCTTGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
M-- CC T GGGCCAAGGAGAGT GCC CCC TACATGTGCCGAGT GAAGACAC TGCCAGACCGGACAT GGACC 

r s TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
|S CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
1SZ TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
9$ AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
111 TCCACAGCGGCATAGCCTGT CCGAGAT CAC CTAC T TAGGGCAGCCAGACATC T CCAT CC TCCAGC 

If! CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
Jfl GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
%l GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
W CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
a CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
f| TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 
fj* CC C TGGGGAT T TGCACAGACAGAACAT C TGACCCAAATGTGC TACACAGTGGGGAGGAAGGGACA 

ill CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
W CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
ill TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
Q GAGCAGCCCACAGAAC T GGAT TC T C T T T TCAGAGGCC TGGCCC TGAC TGTGCAGT GGGAGTCCTG 

||t AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
sM ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAATGACAGTGCAAGGAGGAAATGCAGGGAAACTCCCGAGGTCCAGAGCCCCACCTC 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTT CTGCAGGCAGGAGT T TCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRI TRKS CNLT VE TGNLTELYYARVTAVSAGGRSATKMTDRFS S LQHTTLKPPDVTC I S 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSIAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAACTTTGGGCTGGA 
AGTGT T GAACACAGT T T TGGATAT T TTCCAAAAGAT TT GATCAAGGTAC T TCATAAATACACGGA 
AGAAGAGCTACATAT TCCAGCAGATGAGACAGACTTTGTCTGCTT TGAAGGAGGAAGAGATGAT T 
T TAAT AG T TATAAT G T AGAAGAGC T T T T AGGAT C T T TGGAAC T G GAGGAC T C T G T AC C T GAAGAG 
TCGAAGAAAGCT GAAGAAGTT TC TCAGCACAGAGAGAAAT CT CCT GAGGAGTC T CGGGGGC GTGA 
AC T TGACCCTG TGCC TGAGCCCGAGGCAT TCAGAGC TGAT TCAGAGGAT GGAGAAGGTGC T T TC T 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGC T TACAAAGT CCT GAAAACAGAAAT GAGT CAGAGAGGAAGTGGACAGT GCGT T AT T 
CAT TAC AGCAAAGGAT T T CGT T GGCATCAAAAT C T AAGT T T GT T T T AC AAAGAT T GT T T T TAG T A 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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MAAAPGLLFWLFVLGALWWPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVY YKLAGGS LE LWAGS VEHS FGYFPKDL I KVLHKYTEEELH I PADE TDFVC FE GGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FS E S TEGLQGQPSAQESHPHT S GPAANAQGVQS SLDTFEE I LHDKLKVPGSE S RTGNS S PASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 



m 
III 
111 



N-glycosylation site. 



amino acids 294-298 



in 



Tyrosine kinase phosphorylation site. 



amino acids 67-76 



N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 



amino acids 28-32 



APP ID=1 0063568 



Page 316 of 318 



FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCG ATG AGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAZ\AAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
T T TAAAGAATAT GC TG T GC TAAT AAT GGAGT GAGACAT GC T TAT T T TGC TAAAGGAT GCACC CAA 
AC T T CAAAC T T CAAGCAAAT GAAAT GGACAAT GC AGATAAAG T T G T TAT CAAC AC G T CGGGAGT A 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
T GT ATAT T GTAT T GAAAT T T ACAG T GT GCAAAAG TAT T T T AC C T T T GCAT AAG T GT T T GAT AAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAA.G 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAZ^lTATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTATTTCATTAAGTGTGATATAAACCTCCTCAAACATTTTACTTAGAGGCAAGGAT 
TGTCTAATTTCAATTGTGCAAGACATGTGCCTTATAATTATTTTTAGCTTAAAATTAAACAGATT 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAACAAAAGAAG 
TGACATACACAATATAAATCATATGTCTTCACACGTTGCCTATATAATGAGAAGCAGCTCTCTGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAT^AGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAAC TAC GAAAT CGT GTGAAAAT GGGT T GG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCA7\ATAAAGAGTTCTTGTTTCTGGGGGAAZ\AA 
AAAAAAAAAAAA?\AAA?yW^UW^AAAAAAZyW^ 
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MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCVVMYHQPTANPGLGGPYLYQWNDDRCNMKHNyiCKYEPEINPTAPVEK 
PYLTNQPGDTHQ3STVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites. 

amino acids 86-8,9, 255-258 



jj| cAMP- and cGMP-dependent protein kinase phosphorylation site. 

?y amino acids 266-269 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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